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Description 

[0001 1 The U.S. Government has certain rights in this invention. The development of this invention was partially f unded 
by the United States Government under a HATCH grant from the United States Department of Agriculture, partially 
funded by the United States Government with Formula 1433 funds from the United Slates Department of Agriculture 
and partiaHy funded by the United States Government under contract DAAD 19-02018 awarded by the Army. 

FIELD OF THE INVENTION 

[0002] The present invention relates generally to cell-specific gene regulation in transgenic animals. Animals may be 
made transgenic through administration of a trans posorv based vector through any method of administration including 
pronuclear injection, or intraembryontc, infratesiicular, intraoviduetal or intravenous administration. These transgenic 
animals contain the gene of interest in all cells, including germ ceils. Animals may also be made transgenic by targeting 
specific cells for uptake and gene incorporation of the transposon-based vectors. Stable incorporation of a gene of 
interest into ceils of the transgenic animals is demonstrated by expression of the gene of interest in a cell wherein 
expression is regulated by a promoter sequence. The promoter sequence may be provided as a transgene along with 
the gene of interest or may be endogenous to the cell. The promoter sequence may be constitutive or inducible, wherein 
inducible promoters include tissue-specific promoters, developmental regulated promoters and chemically inducible 
promoters. 

BACKGROUND OF THE INVENTION 

[0003] Transgenic animals are desirable for a variety of reasons, including their potential as biological factories to 
produce desired molecules for pharmaceutical, diagnostic and industrial uses. This potential is attractive to the industry 
due to the inadequate capacity in facilities used for recombinant production of desired molecules and the increasing 
demand by the pharmaceutical industry for use of these facilities, Numerous attempts to produce transgenic animals 
have met several problems, including low rates q! gene incorporation and unstable gene incorporation. Accordingly, 
improved gene technologies are needed for the development of transgenic animals for the production of desired mole- 
cules. 

[0004] Improved gene delivery technologies are also needed for the treatment of disease in animals and humans. 
Many diseases and conditions can be treated with gene-delivery technologies, which provide a gene of interest to a 
patient suffering from the disease or the condition. An example of such disease is Type 1 diabetes. Type 1 diabetes is 
an autoimmune disease that ultimately results in destruction of the insulin producing p-cells in the pancreas, Although 
patients with Type 1 dsabefes may be treated adequately with insulin injections or insulin pumps, these therapies are 
only partially effective, insulin replacement, such as via insulin injection or pump administration, cannot fully reverse the 
defect in the vascular endothelium found in the hyperglycemic state (Pieper et aL, 1998. Diabetes Res. Clin. Praet. 
SuppL S157-S162), In addition, hyper- and hypoglycemia occurs frequently despite intensive home blood glucose mon- 
itoring. Finally, careful dietary constraints are needed to maintain an adequate ratio of consumed caiones consumed. 
This often causes major psychosocial stress for many diabetic patients. Development of gene therapies providing delivery 
of the insulin gene into the pancreas of diabetic patients could overcome many of these problems and result in improved 
life expectancy and quality of life. 

[0005] Several of the prior art gene delivery technologies employed viruses that are associated with potentially unde- 
sirable side effecls and safety concerns. The majority of current gene-delivery technologies useful for gene therapy rely 
on virus-based deMvery vectors, such as adeno and adeno- associated viruses, retroviruses, and other viruses, which 
have been attenuated to no longer roseate. (Kay, M.A., et ai. 2001. Nature Medicine 7:33-40). 

[0006] There are multiple problems associated with the use or viral vectors. First, they are not tissue-specific. In fact, 
a gene therapy trial using adenovirus was recently halted because the vector was present in the patient's sperm (Gene 
trial to proceed despite fears that therapy could change child's genetic makeup. The New York Times, December 23, 
2001). Second, viral vectors are likely to be transiently incorporated, which necessitates re -treating a patient at specified 
time intervals. (Kay, MA., et ai. 2001. Nature Medicine 7:33-40). Third, there is a concern that a viral-based vector could 
revert to its virulent form and cause disease. Fourth, viral-based vectors require a dividing cell for stable integration. 
Fifth, viral-based vectors indiscriminately integrate into various ceils and tissues, which can result in undesirable germfine 
integration. Sixth, the required high titers needed to achieve the desired effect have resulted in the death of one patient 
and they are believed to be responsible for induction of cancer in a separate study. (Science, News of the Week, October 
4, 2002). 

[0007] Accordingly, what is needed is a new vector to produce transgenic animals and humans with stably incorporated 
genes, which vector does not cause disease or other unwanted side effects. There is also a need for DNA constructs 
that would be stably incorporated into the tissues and ceils of animals and humans, including cells in the resting state, 
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which are not replicating. There is a further recognized need in the art for DMA constructs capable of delivering genes 
to specific tissues and cells of animals and humans. 

[0008] When incorporating a gene of interesting an animal for the production of a desired protein or when incorporating 
a gene of interest in an animal or human for the treatment of a disease, it bs often desirable to selectively activate 
incorporated genes using inducible promoters. These inducible promoters are regulated by substances either produced 
or recognized by the transcription control elements within the ceil in which the gene is incorporated. In many instances, 
control of gene expression is desired in transgenic animate or humans so that incorporated genes are selectively activated 
at dessred times and/or under the influence of specific substances. Accordingly, what is needed is a means to selectively 
activate genes introduced into the genome of cells of a transgenic animal or human. This can be taken a step further to 
cause incorporation to be tissue-specific, which prevents widespread gene incorporation throughout a patient's body 
(animal or human). This decreases the amount of DMA needed for a treatment, decreases the chance of incorporation 
in gametes, and targets gene delivery, incorporation, and expression to the desired tissue where the gene is needed to 
function. 

[0009] U3-A-5, 71 9,055 discloses a transposon-based vector for the integration of DIMA into a host genome. 
[0010] US-B1 -6,218,1 85 relates to nucleic acid and amino acid sequences for transformation constructs containing 
specific transposable elements. 

[001 1] Kozak et al, (Journal of MoL Biol., vol. 196 { 1987, pp. 947-950) discloses that sequences flanking the AUG 
initiator codon influence its recognition by eukaryofie ribosomes. At least six nucleotides preceding the AUG initiator 
codon enhance translation in mammalian cells, 

SUMMARY OF THE INVENTION 

[0012] The present invention addresses the problems described above by providing a vector and the use of this vector 
for producing transgenic animals. 

[GQ13] Transgenic animals include all egg-laying animals and milk-producing animals. Transgenic animals further 
include but are not limited to avians, fish, amphibians, reptiles, insects, mammals and humans, In a preferred embodiment, 
the animal is an avian animal In another preferred embodiment, the animal is a milk-producing animal including but 
not limited to bovine, porcine, ovine and equine animals. Animals are made transgenic through administration of a 
composition comprising a transposon-based vector designed for stable incorporation of a gene of interest ior production 
of a desired protein, together with an acceptable carries , A transfection reagent is optionally added to the composition 
before administration. 

[0014] The transposon-hased vectors of the present invention include a) a modified transposase gene operably linked 
to a first promoter, wherein the nucleic acid sequence 3' to the first promoter comprises the sequence as set forth an 
SEQ ID NO; 13, wherein SEQ IP NO: 13 contains the Kozak sequence and a start codon for the transposase, and 
wherein at least one of the first twenty codons of the transposase gene are modified from the wild-type sequence by 
changing a nucleotide at a third base position of the codon to an adenine or thymine without modifying the amino acid 
encoded by the codon, and b) one or more genes of interest operabiy-iinked to one or more additional promoters, and 
wherein the one or more genes of interest and their operabiy-linked promoters are flanked by transposase insertion 
sequences recognized by the transposase encoded by the modified transposase gene, wherein the promoter which is 
operabiy linked to the gene of interest is selected from the group consisting of an ovalbumin promoter, a conaibumirt 
promoter, a vitellogenin promoter or an ovomucoid promoter. The transposojvbased vector also includes the following 
characteristics: a) one or more modified Kczak sequences comprising ACCATG (SEQ ID HO: 13} at the 3 J end of the 
first promoter to enhance expression of the transposase; b) modifications of the codons for the first several N-termsnal 
amino acids of the transposase, wherein the nucleotide at the third base position of each codon was changed to an A 
or a T without changing the corresponding amino acid; c) addition of one or more stop codons to enhance the termination 
of transposase synthesis; and/or, d) addition of an effective polyA sequence operably^inked to the transposase to further 
enhance expression of the transposase gene. 

[001 S] Use of the vectors and compositions, respectively, of the present invention results in highly efficient and stable 
incorporation of a gene of interest into the genome of transfected animals. For example, transgenic avians have been 
mated and produce transgenic progeny in the G1 generation. The transgenic progeny have been mated and produce 
transgenic progeny in the G2 generation. 

[0016] The present invention also provides for tissue-specific incorporation and/or expression of a gene of interest. 
Tissue-specific incorporation of a gene of interest may be achieved by placing the transposase gene under the control 
of a tissue-specific promoter, whereas tissue-specific expression of a gene of interest may be achieved by placing the 
gene of interest under the control of a tissue-specific promoter. According to the invention the promoter which is operably 
linked to the gene of interest is selected from the group consisting of an ovalbumin promoter, a conaibumin promoter, 
a vitellogenin promoter or an ovomucoid promoter. In some embodiments, the gene of interest is transcribed under the 
influence ol an ovalbumin, or other oviduct specific, promoter. Linking the gene of interest to an oviduct specific promoter 
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m an egg-laying animal results in synthesis of a desired molecule and deposition of the desired molecule in a developing 
egg. The present invention further provides for stable incorporation and expression of genes in the epitheHai cells of the 
mammary gland In milk-producing animate. Transcription of the gene of interest in the epithelial cells of the mammary 
gland results in synthesis of a desired molecule and deposition of the desired molecule in the milk, A preferred molecule 
ss a protein- in some embodiments, the desired molecule deposited in the milk is an antiviral protein, an antibody, or a 
serum protein, 

[0017] In other embodiments, specific incorporation of the proinsuhn gene into liver ceils of a diabetic antrnai results 
in the improvement of the animal's condition. Such improvement is achieved by placing a transpdsase gene under the 
control of a liver-specific promoter, which drives integration of the gene of interest in liver cells of the diabetic animal. 
[0018] The present invention advantageously produces a high nnmber of transgenic animals haying a gene of interest 
stably incorporated These transgenic animals successfully pass the desired gene to their progeny. The transgenic 
animals of the present invention also produce large amounts of a desired molecule encoded by the transgene. Transgenic 
egg-laying animals, particularly avians, produce large amounts of a desired protein that is deposited m the egg for rapid 
harvest and purification, Transgenic milk-producing animals produce sarge amounts of a desired protein that is deposited 
in the milk for rapid harvest and purification. 

[0019] Any desired gene may be incorporated into the novel fransposon-based vectors of the present tnvention in 
order to synthesize a desired molecule in the transgenic animals. Proteins, peptides and nucleic acids are preferred 
desired molecules to be produced by the transgenic animals of the present invention. Particularly preferred proteins are 
antibody proteins, 

[C502Q] This invention provides a composition useful for the production of transgenic hens capable of producing sub- 
stantially high amounts of a desired protein or peptide. Entire flocks of transgenic birds may be developed very quickly 
in order to produce industrial amounts of dessred molecules. The present invention solves the problems inherent in she 
inadequate capacity of fermentation fac&ties used for bacterial production of molecules arid provides a more efficient 
and economical way to produce desired molecules, Accordingly, the present invention provides a means to produce 
large amounts of therapeutic, diagnostic and reagent molecules. 

[0021] Transgenic chickens are excellent in terms ol convenience and efficiency of manufacturing molecules such as 
proteins and peptides. Starting with a single transgenic rooster, thousands of transgenic offspring 'can be produced 
within a year. (In principle, up to forty million offspring could be produced in just three 

[0022] It is another object of the present invention to provide novel transposoevbased vectors that encode for the 
production of desired proteins or peptides in cells. 

[0023] It is an object of the present invention to produce transgenic animals through administration of a transposon- 
based vector, 

[G024J Another object of the present invention is to produce transgenic animals through administration of a transposon- 
based vector, wherein the transgenic animals produce desired proteins or peptides. 

[0025] Yet another object of the present invention is to produce transgenic animals through administration of a trans- 
poson- based vector, wherein the transgenic animals produce desired proteins or peptides and deposit the proteins or 
peptides in eggs or milk. 

[0026] St is a further object of the present invention to produce transgenic animals through intraembry on ic, intratesticular 
or intraoviductal administration oi a transposon-based vector. 

[0027] It is a further object of the present invention to provide the use of a vector according to the invention for producing 
a transgenic animal which is capable of producing transgenic progeny, 

[0028] Yet another object of the present invention relates to the use of a vector according to the invention to produce 
transgenic animals through administration of a vector according to the invention that are capabte of producing a desired 
molecule, such as a protein, peptide or nucleic acid. 

[0029] Another object of the present invention relates to the use of a vector according to the invention to produce 
transgenic animals through administration of a vector according to the Invention, wherein such administration results in 
modulation of endogenous gene expression. 

[00303 ^ & another object of the present invention to provide transposon-vectors useful for cell- or tissue-specific 
expression of a gene of interest in an animal or human with the purpose of gene therapy. 

[0031 ] Yet another object of the present invention relates to the use of a vector according to the invention to produce 
transgenic avians through administration of a vector according to the invention that are capable of producing proteins, 
peptides or nucleic acids. 

[0032] it is another object of the present Invent son to produce transgenic animals through administration of a transposon - 
based vector encoding an antibody or a fragment thereof, generations}. Each transgenic female is expected to lay at 
least 250 eggs/yean each potentially containing hundreds of milligrams of the selected protein Flocks of chickens 
numbering in the hundreds of thousands are readily handsed through established commercial systems. The technologies 
for obtaining eggs and fractionating them are also well known and widely accepted. Thus, for each therapeutic, diagnostic., 
or other protein of interest, large amounts of a substantia Hy pure material can be produced at relatively low incremental 
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cost. 

[0033] A wide range of recombinant peptides and proteins can be produced in transgenic egg-laying animate and 
milk-producing animate. Enzymes, hormones, antibodies, growth factors, serum proteins, commodity proteins, biological 
response modifiers, peptides and designed proteins may all be made through practice of the present invention. For 
example, rough estimates suggest that it is possible to produce in bulk growth hormone, insulin, or Factor VI H, and 
deposit them in transgenic egg whites, for an incremental cost in the order of one dollar per gram. At such prices it is 
feasible to consider administering such medical agents by inhalation or even orally, instead of through injection. Even 
if bioavailability rates through these avenues were fow, the cost of a much higher effective-dose would not be prohibitive. 
[0034] In one embodiment, the egg-laying transgenic animal is an avian. The use of a vector according to the invention 
may be done with respect to avians including Ratites, Psittaciformes, Faiconifofmes, Piciformes, Strigiformes, Passer- 
iformes, Coraciformes, RatltforHhes, Cucuitformes, Cclumbiformes, Gaflsforrnes. Anseriforrnes, and Herodiones, Prefer- 
ably, the egg-laying transgenic anamas is a poultry bird. More preferably, the bird is a chicken, turkey, duck, goose or 
quaiL Another preferred bird is a ratite, such as s an emu, an ostrich, a rhea, or a cassowary. Other preferred birds are 
partridge, pheasant, kiwi, parrot, parakeet, macaw, falcon, eagle, hawk, pigeon, cockatoo, song birds, jay bird, blackbird, 
finch, warbler, canary, toucan, mynah, or sparrow, 

[0035] In another embodiment, the transgenic animal is a milk- producing animal, including but not limited to bovine, 
ovine, porcine, equine, and primate animals. Milk-producing animals include but are not limited to cows, goats, horses, 
pigs, buffalo, rabbits, non-human primates, and humans, 

[0038] Accordingly, it is an object of the present invention to provide novel transposon-hased vectors. Still another 
object of the present invention relates to the use of a vector according to the invention to produce transgenic avians 
through administration of a vector according to the invention that are capable of producing proteins or peptides and 
depositing these proteins or peptides in the egg. 

[0037] Another object of the present invention is to provide transgenic avians that contain a stably incorporated trans- 
gene. 

[0038] Still another object of the present invention is to provide eggs containing desired proteins or peptides encoded 
by a transgene incorporated into the transgenic avian that produces the egg. 

[0039] A further object of the present invention relates to the use of a vector according to the invention to produce 
transgenic milk-producing animals through administration of a vector according to the invention that are capable of 
producing proteins, peptides or nucleic acids. 

[0040] Stiff another object of the present invention relates to the use of a vector according to the invention to produce 
transgenic milk-producing animals through administration of a vector according to the invention that are capable of 
producing proteins or peptides and depositing these proteins or peptides in their milk. 

[0041] Another object of the present invention is to provide transgenic milk-producing animals that contain a stably 
incorporated transgene. 

[0042] Another object ol the present invention is to provide transgenic milk-producing animals that are capable of 
producing proteins or peptides and depositing these proteins or peptides in their milk, 

[0043] Yet another object of the present invention is to provide milk containing desired molecules encoded by a 
transgene incorporated into the transgenic milk-producing animals that produce the milk. 

[0044] Still another object of the present invention is to provide milk containing desired proteins or peptides encoded 
by a transgene incorporated into the transgenic milk-producing animals that produce the milk. 

[0045] A further object of the present invention relates to the use of a vector according to the invention to produce 
transgenic sperm through administration of a vector according to the invention to an animal. 

[0046 j A further object of the present invention to provide transgenic sperm that contain a stably incorporated transgene. 
[0047] An advantage of the present invention is that transgenic animals are produced with higher efficiencies than 
observed in the prior art. 

[0040] Another advantage of the present invention is that these transgenic animals possess high copy numbers of 
the transgene. 

[0049] Another advantage of the present invention is that the transgenic animals produce large amounts of desired 
molecules encoded by the transgene. 

[0050] Still another advantage of the present invention is that desired molecules are produced by the transgenic 
animals much more efficiently and economically than prior art methods, thereby providing a means for large scale 
production of desired molecules, particularly proteins and peptides. 

[0051 ] These and other objects; features and advantages of the present invention will become apparent after a review 
of the following detailed description of the disclosed embodiments and claims. 

BRIEF DESCRIPTION OF THE FIGURES 

[0052] 
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Figure 1 depicts schematics I ly a iransposon- based vector containing a transposase operably linked to a first promoter 
and a gene of in Merest operably-linked to a second promoter, wherein the gene of interest and its operabfy-iinked 
promoter are flanked by insertion sequences (IS) recognized by the transposase. "Pro 11 designates a promoter. In 
this and subsequent figures, the size of the actual nucleotide sequence is not necessary proportionate to the box 
representing that sequence. 

Figure 2 depicts schematically a transposon-based vector for targeting deposition of a polypeptide in an egg white 
wherein Ov pro is the ovalbumin promoter, Ov protein is the ovalbumin protein and PolyA is a polyadenylation 
sequence. The TAG sequence includes a spacer, the gp41 hairpin loop from HIV I and a protein cleavage site. 

Figure 3 depicts schematically a transposombased vector for targeting deposition of a polypeptide in an egg white 
wherein Ovo pro is the ovomucoid promoter and Ovo SS is the ovomucoid signal sequence. The TAG sequence 
includes a spacer, the gp41 hairpin loop from HIV \ and a protein cleavage site. 

Figure 4 depicts schematically a transposon-based vector tor targeting deposition of a polypeptide in an egg yolk 
wherein Vit pro is the vitellogenin promoter and Vit targ is the vitellogenin targeting sequence 

Figure 5 depicts schematically a transposon-hased vector for expression of antibody heavy and fight chains, Frepro 
indicates a prepro sequence from ceeropin and pro indicates a pro sequence from ceeropin, 

Figure 6 depicts schematically a transposorvbased vector for expression of antibody heavy and light chains; Ent 
indicates an enterokinase cleavage sequence. 

Figure 7 depicts schematically egg white targeted expression of antibody heavy and light chains from one vector in 
either tail-to-sail {Figure 7A) or taii-to-head (Figure 7B) configuration, in the tail-to-tail configuration, the ovalbumin 
signal sequence adjacent to the gene for the light chain contains on its 3 s end an enterokinase cleavage site (not 
shown) to allow cleavage of the signal sequence from the light chain, and the ovalbumin signal sequence adjacent 
to the gene for the heavy chain contains on its 5' end an enterokinase cleavage site (not shown) to allow cleavage 
of the signal sequence from the heavy chain in the taii4o~head configuration, the ovalbumin signal sequence 
adjacent to the gene for the heavy chain and the fight chain contains on its 3' end an enterokinase cleavage site 
(not shown) to allow cleavage of the signal sequence from the heavy or light chain. 

DETAILED DESCRIPTION OF THE INVENTION 

[0O53] The present invention provides the use of a vector according to the invention for producing transgenic animals, 
particularly egg-laying animals and milk-producing animals, through administration of a composition comprising a vector 
according to the invention designed for stable incorporation of a gene of interest for production of a desired molecule. 

Definitions 

[Q0£4] It is to be understood that as used In the specification and in the claims, (i a" or "an" can mean one or more, 
depending upon the context in which it is used. Thus, for exampie, reference to "a ceil" can mean that at least one cell 
can be utilized. 

[0QS53 The term "antibody" is used interchangeably with the term 'immunoglobulin^ and is defined herein as a protein 
synthesized by an animal or a cell of the immune system in response to the presence of a foreign substance commonly 
referred to as an 'antigen" or an "immunogen" The term antibody includes fragments of antibodies. Antibodies are 
characterized by specific affinity to a site on the antigen, wherein the site is referred to an "antigenic determinant" or an 
"epitope". Antigens can be naturally occurring or artificially engineered. Artificial engineered antigens include but are 
not limited to smaii molecules, such as small peptides, attached to haptens such as macromolecuies^ for example 
proteins, nucleic acids, or polysaccharides. Artificially designed or engineered variants of naturally occurring antibodies 
and artificially designed or engineered antibodies not occurring in nature are ail included in me current definition. Such 
variants include conservatively substituted amino acids and other forms of substitution as described in the section 
concerning proteins and polypeptides. 

[0058] As used herein, the term "egg-laying animal" includes ali amniotes such as birds, turtles, lizards and 
monotremes. Monotremes are egg-faying mammals and include the platypus and echidna. The term "bird" or "fowl," as 
used herein, is defined as a member of the Aves class of animals which are characterized as warm- blooded, egg -laying 
vertebrates primary adapted for flying. Avians include, without limitation, Basifes, Psittaeiformes, Falconsformes, Fid- 
formes, Strigiformes, Passeriforrnes, Goracfformes, Railiformes, Cucuiiformes, Columbiformes, Galliformes, Anseri- 
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formes, and Herodiones. The term 1 Petite," as used herein, is defined as a group of flightless, mostly large, running 
birds comprising several orders and including the ernus, ostriches, kiwis, and cassowaries. The term :( Psittaciforroes k \ 
as used herein, Includes parrots and refers to a monofamilial order of birds mat exhibit ,£ygodaetyiism and have a strong 
hooked hill. A 'parrot" is defined as any member of the avian family Psittaddae (the single family of the Psittaciforrnes), 

5 distinguished by the short, stout, strongly hooked beak. The term "chicken" as used herein denotes chickens used for 
table egg production, such as egg-type chickens, chickens reared for public meat consumption, or broilers, and chickens 
reared for both egg and meat production ("dual-purpose" chickens). The term "chicken 0 aeso denotes chickens produced 
by primary breeder companies, or chickens that are the parents, grandparents, great-grandparents, etc. of those chickens 
reared for public fable egg, meat, or table egg and meat consumption, 

io {0057 j The term "egg" is defined herein as a large female sex cell enclosed in a porous, caicarous or leathery shell, 
produced by birds and reptiles. The term "ovum" is defined as a female gamete, and is also known as an egg. Therefore, 
egg production in aM animals other than birds and reptiles, as used herein, is defined as the production and discharge 
of an ovum from an ovary, or "ovulation". Accordingly, it is to be understood that the term "egg" as used herein is defined 
as a large female sex cell enclosed in a porous, calcarous or leathery shell, when a bird or reptile produces rt, or it is an 
ovum when it is produced by alf other animals. 

[0058] The term "milk-producing animal" refers herein to mammals including, but not limited to, bovine, ovine, porcine, 
equine, and primate animals. Milk-producing animals inciude but are not limited to cows, llamas, camels, goafs, reindeer, 
zebu, water buffalo, yak, horses, pigs, rabbits, non- human primates, and humans, 

[0059] The terns "gene" is defined herein to include a coding region for a protein, peptide or polypeptide. 

20 [0060] The term 'Vector 5 ' is used interchangeably with the terms "construct", "DNA construct" and ' genetic construct' 1 
to denote synthetic nucleotide sequences used for manipulation of genetic material, including but not limited to cloning, 
subcloning, sequencing, or introduction of exogenous genetic material into ceils, tissues or organisms, such as birds. It 
is understood by one skilled in the art that vectors may contain synthetic DNA sequences, naturally occurring DNA 
sequences, or both. The vectors of the present invention are transposon-based vectors as described herein. 

25 [0061] When referring to two nucleotide sequences, one being a regulatory sequence, the term 'operabiy-iinked" is 
defined herein to mean that the two sequences are associated in a manner that allows the regulatory sequence to affect 
expression of the other nucleotide sequence. It is not required that the operabiy-i inked sequences be directly adjacent 
to one another with no intervening sequence(s). 

[0062] The term '"regulatory sequence" is defined herein as including promoters, enhancers and other expression 
30 control elements such as poiyadenyiation sequences, matrix attachment sites, insulator regions for expression of multiple 
genes on a single construct, nbosome entry/attachment sites, introns that are able to enhance expression, and silencers, 

T ra n sposo n- Ba sed Vectors 

35 [0063] While not wanting to be bound by the following statement, it is believed that the nature of the DNA construct 
is an important factor in successfully producing transgenic animals. The "standard 11 types of piasmid and viral vectors 
that have previously been almost universally used for transgenic work in ail species, especially avians, have low effi- 
ciencies and may constitute a major reason for the low rates of transformation previously observed. The DNA (or RNA) 
constructs previously used often do not integrate into the host DNA, or integrate only at low frequencies. Other factors 

40 may have also played a part, such as poor entry of the vector into target cells. The present invention provides transposon- 
based vectors that can be administered to an animal that overcome the prior art problems relating to low transgene 
integration frequencies. Two preferred transposon-based vectors of the present invention in which a tranposase, gene 
of interest and other polynucleotide sequences may be introduced are termed pTnMCS (SEQ ID NO:38) and pTnMod 
(SEQ IDNO:1). 

[0064] The transposombased vectors of the present invention produce integration frequencies an order of magnitude 
greater than has been achieved with previous vectors. More specifically, intratesticular injections performed with a prior 
art transposon-based vector (described in U.S, Patent No, 5,719,055) resulted in 41% sperm positive roosters whereas 
intratesticular injections performed with the novel transposon-based vectors of the present invention resulted in 77% 
sperm positive roosters. Actual frequencies of integration were estimated by either or both comparative strength of the 

so PGR signal from the sperm and histological evaluation of the testes and sperm by quantitative PGR. 

[0085] The transposon-based vectors of the present invention include a) a modified transposase gene operabEy linked 
to a first promoter, wherein the nucleic acid sequence 3 ? to the first promoter comprises the sequence as set forth in 
SEQ ID NO: 13, wherein SEQ ID NO: 13 contains the Kozak sequence and a start codon tor the transposase, and 
wherein at least one of the first twenty codons of the transposase gene are modified from the wild-type sequence by 

55 changing a nucleotide at a third base position of the codon to an adenine or thymine without modifying the amino add 
encoded by the codon, and b) one or more genes of interest ope rably-i inked to one or more additional promoters, and 
wherein the one or more genes of interest and their operably-iinked promoters are flanked by transposase insertion 
sequences recognized by the transposase encoded by the modified transposase gene, wherein the promoter which is 
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operably linked to the gene of interest is selected from the group consisting of an ova toy men promoter, a conaJburnin 
promoter, a vitellogenin promoter or an ovomucoid promoter. The transposon-based vector also includes one or more 
o! the following characteristics: a) one or more modified Kozak sequences comprising ACCATG {SEQ ID MD:1 3} at the 
3 : end of the first promoter to enhance expression of the transposase; b) modifications of the codons for the first several 

5 N-terminai amino acsds of the transposase, wherein the third base of each codon was changed to an A or a; T without 
changing the corresponding amino acid; c) addition of one or more stop codons to enhance the termination of transposase 
synthesis; and, d) addition of an effective poiyA sequence operably-Hnked to the transposase to further enhance ex- 
pression of the transposase gene. Figure 1 shows a schematic representation of several components o! the transposon- 
based vector. The present invention further includes vectors containing more than one gene of interest, wherein a second 

w or subsequent gene of interest is operably -linked to the second promoter or to a different promoter It is also to be 
understood that the transposon-based vectors shown in the Figures are representational of the present invention and 
that the order of the vector elements may be different than that shown in the Figures, that the elements may be present 
in various orientations, and that the vectors may contain additional elements not shown in the Figures. 
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In a further embodiment of the present invention, the transposase found in the transposase-based vector is an 
altered target site (ATS) transposase and the insertion sequences are those recognized by the ATS transposase. 
However, the transposase located in the transposase-based vectors is not limited to a modified ATS transposase and 
20 can be derived from any transposase. Transposases known in the ^nor art include those found in ACT, Tn5SEQ1 , 
Tn918 5 Tn951, Tn1721, Tn 2410, Tni 881 , Tn1 , Tn2, Tn3, Tn4, Tn5, Tn6, Tn9, Tn10, Tn30 t Tni01 s Tn903, Tn501 . 
TnlOOQ (yS), Tni 881 , Tn2901 , AC transposons, Mp transposons, 8pm transposons, En transposons, Dotted transposons^ 
Mu transposons, Ds transposons, dSprn transposons and I transposons. According to the present invention, these 
transposases and their regulatory sequences are modified for improved functioning as follows: a) the addition one or 
25 more modified Kozak sequences comprising ACCATG {SEQ ID MO: 1 3) at the 3' end of the promoter operabiy-linked to 
the transposase; b) a change of the codons for the first several amino acids of the transposase, wherein the third base 
of each codon was changed to an A or a T without changing the corresponding amino acid; c) the addition of one or 
more stop codons to enhance the termination of transposase synthesis; and/or : d) the addition of an effective polyA 
sequence operably-iinked to the transposase to further enhance expression of the transposase gene. 
30 [0087] Although not wanting to he bound by the following statement, it ss believed that the modifications of the first 
several N-terminal codons of the transposase gene increase transcription of the transposase gene, in part, by increasing 
strand dissociation, it is preferable that between approximately 1 and 20, more preferably 3 and 1 5. and most preferably 
between 4 and 12 of the first N-ferminal codons of the transposase are modified such that the third base of each codon 
is changed to an A or a T without changing the encoded amino acid. In one embodiment, the first ten ^-terminal codons 
35 of the transposase gene are modified in this manner, it is also preferred that the transposase contain mutations that 
make it less specific for preferred insertion sites and thus increases the rate of transgene insertion as discussed in U.S. 
Patent No. 5,719,055, 

[0088] In some embodiments, the transposon-based vectors are optimized for expression sn a particular host by 
changing the methylation patterns of the vector DNA. For example, prokaryotic roethylation may be reduced by using a 
40 methyiation deficient organism for production of the transposon-based vector. The f ransposon-based vectors may also 
be methylated to resemble eukaryotic DMA for expression m a eukaryotic host 

[0069] Transposases and insertion sequences from other analogous eukaryotic transposon-based vectors that can 
aiso be modified and used are, for example, the DrosophtSa P element derived vectors disclosed tn U.S, Patent No. 
6,291,243; the Drosophila mariner element described in Sherman et al. (1998): or the sleeping beauty transposon. See 

45 also Hackett et al (1999); D. Lampe et at., 1999. Proa Natl. Acad, ScL USA, 96:11428-11433; S. Fischer et al., 2001, 
Proc. Natl. Acad. ScL USA, 98:6759-6764; L. Zagoralou et aL 2001, Proc. NatL Acad. Sci. USA, 98:1 1474-11478; and 
D. Serg et at. (Eds ), Mobile DNA 5 Amer. Soa Miorobios. (Washington, D.G., 1989). However, it should be noted that 
bacteria? transposon-based elements are preferred, as there is less likelihood that a eukaryotic transposase in the 
recipient species will recognize prokaryotic insertion sequences bracketing the transgene. 

so [0070] Many transposases recognize different insertion sequences, and therefore, it is to be understood that a trans- 
posase-based vector will contain insertion sequences recognized by the particular transposase also found in the trans- 
posase- based vector. In a preferred embodiment of the invention, the insertion sequences have been shortened to about 
70 base pairs in length as compared to those found in wild-type transposons that typically contain insertion sequences 
of well over 1 00 base pairs, 

[0071 ] While the examples provided below incorporate a "cut and insert" TnIO based vector that is destroyed following 
the insertion event, the present invention also encompasses the use of a 'rolling replication" type transposon-based 
vector. Use of a rowing replication type transposon allows multiple copies of the transposon/transgene to be made from 
a single transgene construct and the copies inserted. This type of transposon-based system thereby provides for insertion 
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of multiple copies of a transgene into a single genome- A rolling replication type transposon- based vector may be 
preferred when the promoter operably-iinked to gene of interest is endogenous to the host cell and present in a high 
copy number or highly expressed. However, use of a rolling replication system may require tight control to Hmit the 
insertion events to non-lethal levels, Tn1 f Tn2. Tn3, Tn4 s Tn5 5 Tn9, Tn21, Tn5G1, Tn551 t Tn9S1 t Tn1721 f Tn2410 and 
Tn2603 are examples of a roiling replication typo transpose^ although Tn5 could be both a roiling replication and a cut 
and insert type transposon. 

Stop Codons and PotyA Sequences 

[0072] In one embodiment, the transposombased vector contains two stop codons operably-linked to the transposase 
and/or to the gene of interest. In an alternate embodiment, one stop codon of UAA or UGA is operably linked to the 
transposase and/or to the gene of interest. As used herein an "effective polyA sequence" refers to either a synthetic or 
non-synthetic sequence that contains multiple and sequential nucleotides containing an adenine base (an A polynucle- 
otide string) and that increases expression of the gene to which it is operabiy-linked. A polyA sequence may be operably- 
linked to any gene in the transposes -based vector including, but not limited to, a transposase gene and a gone of interest 
In one embodiment, a polyA sequence comprises the polynucleotide sequence provided in SEQ ID NQ:28, A preferred 
polyA sequence is optimized for use in the host animal or human, in one embodiment, the polyA sequence is optimized 
for use in a bird, and more specifically, a chicken. The chicken optimized polyA sequence generally contains a minimum 
of 60 base pairs, and more preferably between approximately 80 and several hundred base pairs ; that precede the A 
polynucleotide string and thereby separate the stop codon from the A polynucleotide string. A chicken optimized polyA 
sequence may also have a reduced amount of CT repeats as compared to a synthetic polyA sequence, in one embodiment 
of the present invention, the polyA sequence comprises a conaihumin polyA sequence as provided in SEQ ID NO:33 
and as taken from GenBank accession # Y00407, base pairs 10651-1 1058. 

Promoters and Enhancers 

[0073] The first promoter ope rabfy- linked to the transposase gene can be a constitutive promoter or an inducible 
promoter. The second promoter operably-iinked to the gene of interest is selected from the group consisting of an 
ovalbumin promoter, a eonaiburnin promoter, a vitellogenin promoter or an ovomucoid promoter. Constitutive promoters 
include, but are not limited to ; immediate early cytomegalovirus (ClvtV) promoter, herpes simplex virus 1 (HSV1 ) imme- 
diate early promoter, SV40 promoter, lysozyrne promoter, early and late CMV promoters, early and late HSV promoters. 
P-actin promoter, tubulin promoter, Rous-Sareorna virus (RSV) promoter, and heat-shock protein (HSP) promoter. In- 
ducible promoters include tissue-specific promoters, developrnentaily-reguiated promoters and chemically inducible 
promoters. Examples of tissue-specific promoters include the glucose 6 phosphate (G6P) promoter, vitellogenin pro- 
moter, ovalbumin promoter, ovomucoid promoter, oonalbumin promoter, ovofransferrin promoter, prolactin promoter, 
kidney uromodulin promoter, and placental lactogen promoter, in one embodiment, the vitellogenin promoter includes 
a polynucleotide sequence of SEQ 3D NO:17. The G6P promoter sequence may be deduced from a rat G6P gene 
untranslated upstream region provided in GenBank Accession number U57552,1 Examples of developmentafiy-regu- 
iated promoters include the horneobox promoters and several hormone induced promoters. Examples of chemicaliy 
inducible promoters include reproductive hormone induced promoters and antibiotic inducible promoters such as the 
tetracycline inducible promoter and the zinc-indueible metaliothionine promoter. 

[0074] Other inducible promoter systems include the Lac operator repressor system inducible by I PTG (isopropyi beta- 
D-thiogalactoside) (Cronin, A, et aL 2001 . Genes and Development, v. 1 5), ecdysone- based inducible systems {Hoppe. 
IL C. et a!. 2000, MoL Ther. 1:159-184); estrogen-based inducible systems (Braselmann, S. et aL 1993. Froe Natl. 
Acad. ScL 90:1857-1661); progesterone-based inducible systems using a chimenc regulator, GLVP, which is a hybrid 
protein consisting of the GAL4 binding domam and the herpes simplex virus transcriptional activation domain, VP18, 
and a truncated form of the human progesterone receptor that retains the ability to bind ligand and can be turned on by 
RU486 (Wang, et aL 1994, Pros, Natl. Acad. ScL 91:8180-8184); CiD-based inducible systems using chemical inducers 
of diroerization (CIDs) to regulate gene expression, such as a system wheresn rapamycin induces dimerization of the 
cellular proteins FKBP12 and FRAP (Beishaw, P. J, et aL 1998, J. Cbero. Biol. 3:731-738; Fan, L et aL 1999. Hum. 
Gene Ther. 10:2273-2285; Shariat, S.F. et al 2001. Cancer Res. 61:2562-2571; Spencer, DM. 1996. Curr, BioL 6: 
839-847). Chemical substances that activate the chemically inducible promoters can be administered to the animal 
containing the transgene of interest via any method known to those of skill in the art. 

[0075] Other examples of ceil or tissue-specific and constitutive promoters include but are not limited to smooth-muscle 
3M22 promoter, including chimeric SM22aipha/teiokin promoters (Hoggatt A.fVL et aL, 20G2. Dire Res, 91 (12):1 151-9); 
uhiquitin C promoter {Biochim Biophys Acta. 2003, Jan. 3;1825(1):52-63); Hsf2 promoter: murine COMP (cartilage 
oligomeric matrix protein) promoter: early 8 ce^specific rnh-1 promoter {Sigvardsson M., et aL ? 2002. Moi. Ceil BioL 22 
(24):8539-51); prostate specific antigen (PSA) promoter {Yoshimura L et aL, 2002, J„ UroL 168(6}:2659-64); exorh 
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promoter and pineal expression-promotmg element (Asaoka Y. f et a!., 2002, Proc. Nat!. Acad Sei 9Q{24): 15458-61); 
neural and liver ceramidase gene promoters (Okino M et a!., 2002 Biochera Biophys. Res, Common. 290(1): 180-6); 
PS^P94gerte promoter/enhancer (GabritMY. etal, 2002. Gene Ther. 9(23): 1589-99); promoter of the human FAT/CD38 
gene {Koriki G. 5 et aL, 2002. BioL Pharm Butt. 25(11): 1476-8); VL30 promoter (Staplin W.R. et aL, 2002. Blood October 
24. 2002); IL-10 promoter (Brenner S„ { et aL 2002. J. BioL Chem. December 18, 2002). 

[0078] Examples of avian promoters include, but are not limited to, promoters controlling expression of egg white 
proteins, such as ovalbumin, ovotransferrin (eonalbumin), ovomucoid > iysozyme, ovomucin, g2 ovog iobu I i n , g3 o vog lob- 
ulin f ovofiavoprotein, ovostatin {ovomacrogiobsn), cystatim avidin, thiarnine-bindang protein, glutamyl aminopeptidase 
minor glycoprotein 1 f minor glycoprotein 2; and promoters controlling expression of egg-yolk proteins, such as? vitellogenin, 
very low-density lipoproteins, low density lipoprotein, cobafamin-bindeng protein, riboflavin- binding protein, biotth-bj^ding 
protein {Awade, 1998. 2. Lebensm. Unters. Forsch 202:1-14). An advantage of using the vitellogenin promoter is that 
it is active during the egg-faying stage of an animal s life-cycle, which allows for the production of the protein of interest 
to be temporally connected to the import of the protein of interest into the egg yolk when the protein of interest is equipped 
with an appropriate targeting sequence. 

|GG77] Liver- specific promoters of the present invention include, but are not limited to : the following promoters, vitel- 
logenin promoter, GOP promoter, cholesterol~7-alpha-bydroxylase (CYP7A) promoter, phenylalanine hydroxylase (PAH) 
promoter, protein C gene promoter, insulin-like growth factor 1 (IGF-I) promoter, bilirubin U D P-giucu ronosyttransi erase 
promoter, aldolase B promoter, fuhn promoter, metaliothioneine promoter, albumin promoter, and ins-;:sn promoter. 
[0078] Also included in the present invention are promoters that can be used to target expression of a protein of interest 
into the milk of a mi Ik- producing animal including, but not limited to, p laetoglobin promoter, whey acidic protein promoter, 
iactalbumin promoter and casein promoter, 

[0079] Promoters associated with cells of the immune system may also be used. Acute phase promoters such as 
intedeukin (ID- 1 and 1L-2 maybe employed. Promoters for heavy and light chain Ig may also be employed. The promoters 
of the T cell receptor components CD4 and C138, B cell promoters and the promoters of CR2 (complement receptor type 
2) may also be employed, immune system promoters are preferably used when the desired protein ss an antibody protein. 
£0080] Aiso included in this invention are modified promoters/enhancers wherein elements of a single promoter are 
duplicated, modified, or otherwise changed. In one embodiment, steroid hormone-bending domains of the ^ ovalbumin 
promoter are moved from about -8.5 kb to within approximately the first 1 000 base pairs of the gene of interest Modifying 
an existing promoter with promoter/enhancer elements not found naturally in the promoter, as well as building an entirely 
synthetic promoter, or drawing promoter/enhancer elements from various genes together on a non-natural backbone, 
are aH encompassed by the current invention. 

[0081] Accordingly, it is to be understood that the promoters contained within the transposon -based vectors of the 
present invention may be en-ire promoter sequences or fragments of promoter sequences. For example, in one embod- 
iment, the promoter operably linked to a gene of interest is an approximately 900 base pair fragment of a chicken 
ovalbumin promoter {SEQ ID NO:40). The constitutive and inducible promoters contained within the: transposon -based 
vectors may also be modified by the addition of one or more modified Kozak sequences of ACCATG (SEQ ID NO: 13). 
[Q082] As indicated above, the present invention includes transposon-based vectors containing one or more enhancers. 
These enhancers may or may not be ope rabiy-l inked to their native promoter and may be located at any distance from 
their operably-linked promoter A promoter operabiy-iinked to an enhancer is referred to herein as an "enhanced pro- 
moter." The enhancers contained within the transposomhased vectors are preferably enhancers found in birds, and 
more preferably, an ovalbumin enhancer, but are not limited to these types of enhancers. In one embodiment, an 
approximately 875 base pair enhancer element of an ovalbumin promoter is cloned upstream ot an ovalbumin promoter 
with 300 base pairs of spacer DNA separating the enhancer and promoter. In one embodiment, the enhancer used as 
a part of the present invention comprises base pairs 1-675 of a Chicken Ovalbumin enhancer from GenBank accession 
#382527.1. The polynucleotide sequence of this enhancer is provided in SEQ ID NO;37. 

[0083] Also included in some of Ihe I ra nsposon- based vectors of the present invention are cap sites and fragments 
of cap sites, in one embodiment approximately 50 base pairs of a 5' untranslated region wherein the capsite resides 
are added on me 3' end of an enhanced promoter or promoter. An exemplary 5 s untranslated region is provided in SEQ 
ID NG:38. A putative cap-site residing in this 5' untranslated region preferably comprises the polynucleotide sequence 
provided in SEQ ID NO: 39. 

[0084] In one embodiment of the present invention, the first promoter operably- linked to the transposase gene is a 
constitutive promoter and ihe second promoter operably -linked to the gene of interest is a tissue-specific promoter. In 
this embodiment, use of the first constitutive promoter allows for constitutive activation of the transposase gene and 
incorporation of the gene of interest into virtually all ceil types, including the germiene of the recipient animal. Although 
the gene of interest is incorporated into the germline generally, the gene of interest is only expressed in a tissue-specific 
manner, it should be noted that ceiS- or tissue-specific expression as described herein does not require a complete 
absence of expression in cells or tissues other than the preferred ceil or tissue. Instead, '"ceil -specific" or "tissue-specific" 
expression refers to a majority of the expression of a particular gene of interest in the preferred cell or tissue, respectively. 
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[0085] When incorporation of the gene of interest into the germline is not preferred, the first promoter operably-linked 
to the transposase gene can be a tissue-specific promoter. For example, transfection of a transposon-based veotor 
containing a transposase gene operabiy-linked to a liver-specific promoter such as the G6P promoter or vitellogenin 
promoter provides for activation of the transposase gene and incorporation of the gene of interest in the ceils of the liver 

5 but no! into the germNne and other cells generally, In this second embodiments, the second promoter operably-iinked 
to the gene of interest can be an ovalbumin, promoter, a conalbumin promoter, a vitellogenin promoter or an ovomucoid 
promoter, in embodiments wherein tissue-specific expression or incorporation is desired, it is preferred that the trans- 
poson-based vector is administered dir&ctly to the tissue of interest or to an artery leading to the tissue of interest. 
[0088] Accordingly, cell specific promoters may be used to enhance transcription in selected tissues. In birds, for 

io example, promoters that are found in cells of the falsopian tube, such as ovalbumin, conalbumin, ovomucoid and/or 
iysozyme s are used in the vectors to ensure transcription of the gene of interest in the epithelial ceils and tubular gland 
ceils of the fallopian tube, leading to syntheses of the desired protein encoded by the gene and deposition into the egg 
white. In mammals, promoters specific for the epithelial cells of the alveoli of the mammary gland, such as prolactin, 
insulin, beta iactoglobin, whey acidic protein, iactaibumin, casein, and/or placental lactogen, are used in the design of 

*5 vectors used for transfection of these cells for the production of desired proteins for deposition into the milk. In liver cells, 
the G6P promoter may be employed to drive transcription of the gene of interest for protein production. Proteins made 
in the liver of birds may be delivered to the egg yolk. 

[0087] In order to achieve higher or more efficient expression of the transposase gene, the promoter and other regu- 
latory sequences operably-linked to the transposase gene may be those derived from the host These host specific 
20 regulatory sequences can be tissue specific as described above or can be of a constitutive nature. For example, an 
avian actio promoter and its associated poiyA sequence can be operabiy-l inked to a transposase in a transposase- 
based vector for transfection into an avian. Examples of other host specific promoters that .could be operably-linked to 
the transposase include the myosin and DNA or RNA polymerase promoters. 

25 Directing Sequences 

[0088] In some embodiments of the present invention, the gene of interest is operably-Hnked to a directing sequence 
or a sequence that provides proper conformation to the desired protean encoded by the gene of interest. As used herein, 
the term "directing sequence 0 refers to both signal sequences and targeting sequences. An egg directing sequence 

30 includes, but is not limited to, an ovomucoid signal sequence, an ovalbumin signal sequence and a vitellogenin targeting 
sequence. The term "signal sequence" refers to an amino acid sequence, or the polynucleotide sequence that encodes 
the amino acid sequence, that directs the protein to which it is linked to the endoplasmic reticulum in a eukaryofe, and 
more preferably the transfocational pores in the endoplasmic reticulum, or the plasma membrane in a prokaryote. or 
mitochondria, such us for the purpose of gene therapy of mitochondrial diseases. Signal and targeting sequences can 

35 be used to direct a desired protein into, for example, the milk, when the transposon-based vectors are administered to 
a milk-producing animal, 

[0089] Signal sequences can also be used to direct a desired protein into, for example, a secretory pathway for 
incorporation into the egg yolk or the egg white, when the transposon-based vectors are administered to a bird or other 
egg-laying animal One example of such a transposon-based vector is provided in Figure 3 wherein the gene of interest 
4® is operabiy linked to the ovomucoid signal sequence. The present invention also includes a gene of interest operably- 
linked to a second gene containing a signal sequence. An example of such an embodiment is shown in Figure 2 wherein 
the gene of interest is operably-Hnked to the ovalbumin gene that contains an ovalbumin signal sequence. Other signal 
sequences that can be included in the transposon-based vectors include, but are not limited to the ovotransferrin and 
lysozyrne signal sequences. 

45 [0090] As also used herein, the term "targeting sequence" refers to an amino acid sequence, or the polynucleotide 
sequence encoding the amino acid sequence, which amino acid sequence is recognized by a receptor located on the 
exterior of a ceil Binding of the receptor to the targeting sequence results in uptake of the protein or peptide operabiy- 
linked to the targeting sequence by the ceil. One example of a targeting sec?uence ss a vitellogenin targeting sequence 
that is recognized by a vitellogenin receptor {or the low density lipoprotein receptor) on the exterior of an oocyte, in one 

50 embodiment, the vitellogenin targeting sequence includes the polynucleotide sequence of SEQ ID NO: 18, in another 
embodiment, the vitellogenin targeting sequence includes ail or part of the vitellogenin gene. Other targeting sequences 
include VLDL and Apo £, which are also capable of binding the vitellogenin receptor, Since the ApoE protein is not 
endogenously expressed in birds, its presence may be used advantageously to identify birds carrying the transposon- 
based vectors of the present invention. 

55 

!^ ^0^^ 9! I n ^ ^ nc:o<cj ^ r A~ >e ^^ r ec ^ Proteins 

[0091] A gene of interest selected for stable incorporation is designed to encode any desired protein or peptide or to 
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regulate any cellular response In some embodiments, the desired proteins or peptides are deposited in an egg; or in 
milk. It is to be understood that the present invention encompasses transposon-based vectors containing muitipse genes 
of interest The multiple genes of interest may each be operably-hnked to a separate promoter and other regulatory 
sequenee(s) or may ail be ope rabfy-l inked to the same promoter and other regulatory sequencers), th one embodiment, 
multiple gene of interest are linked to a single promoter and other regulatory sequence(s) and each gene of interest is 
separated by a cleavage site or a pro portion of a signal sequence. 

[0092] Protein and peptide hormones are a preferred class of proteins in the present : n ventlo n . Such protei n a nd 
peptide hormones are synthesized throughout the endocrine system and include, but are not limited to, hypothalamic 
hormones and hypophysiotropic hormones, ani^br, intermediate and posterior pituitary hormones, pancreatic islet 
hormones, hormones made in the gastrointestinal system, renal hormones, thymic hormones, parathyroid hormones, 
adrenal cortical and medullary hormones. Specifically, hormones that can be produced using the present invention 
include, but are not limited to, chorionic gonadotropin, corticotropin ; erythropoietin, glucagons, ?GF^1; oxytocin, platelet- 
derived growth factor, calcitonin. follicle- stimulating hormone, leutinizing hormone, thyroid-stimusating hormone, insulin, 
gonadotropin-releasing hormone and its analogs, vasopressin, octreotide, somatostatin, p role ctin ; ad ri?nocofticotropic 
hormone, antidiuretic hormone, thy rotropin-reteasing hormone {TRH}, growth hormone -releasing hormone (CSHBH), 
dopamine, melatonin, thyroxin {T 4 ), parathyroid hormone (PTH), glucocorticoids such as Cortisol, mineralocorticoids 
such as aldosterone, androgens such as testosterone, adrenaline (epinephrine), noradrenaline (norepinephrine), estro- 
gens such as estradiol, progesterone, glucagons, eaicitroi, calciferol, atrial-natriuretic peptide, gastrin, secretin, choEe- 
cystokin in (CCK) ; neuropeptide Y, ghreHn, PVY 3 36; angiotensinogen, thrombopoietin, and leptir) By using appropriate 
polynucleotide sequences, species-specific hormones may be made by transgenic animals. 

[0093] In one embodiment of the present invention, the gene of interest is a proinsuiin gene and the desired molecule 
Is insulin. Proinsuiin consists of three parts; a C- peptide and two long strands of amino acids (called the alpha and beta 
chains) that later become linked together to form the insulin molecule. Figures 2 and 3 are schematics of transposes 
based vector constructs containing a proinsuiin gene operabiy-Sinked to an ovalbumin promoter and ovalbumin protein 
or an ovomucoid promoter and ovomucoid signal sequence, respectively, sn these embodiments, proinsuiin is expressed 
in the oviduct tubular gland ceils and then deposited in the egg white. One example of a proinsuiin polynucleotide 
sequence is shown in SEQ ID NO:21, wherein the C peptsde cleavage site spans from Arg at position 31 to Arg at 
position 65. 

[0094] Serum proteins including lipoproteins such as high density lipoprotein (HDL), HOL-Mlano and low density 
lipoprotein, albumin, clotting cascade factors, factor VIM, factor IX, fibrinogen, and globulins are also included In the 
group of desired proteins of the present invention. Immunoglobulins are one class of desired globulin molecules and 
include but are not limited to IgG, IgM, IgA, IgD, IgE, IgY, lambda chains, kappa chains and fragments thereof; Fc 
fragments, and Fab fragments. Desired anybodies include, but are not limited to, naturally occurring antibodies- human 
antibodies, humanized antibodies, and hybrid antibodies. Genes encoding modified versions of n&furaliy occurring an- 
tibodies or fragments thereof and genes encoding artificially designed antibodies or fragments thereot may be incorpo- 
rated info the trans poson -based vectors of the present invention. Desired antibodies also include antibodies with the 
ability to bind specific ilgands, for example, antibodies against proteins associated with cancer- related molecules, such 
as ants- her 2, or anti-CA125, Accordingly, the present invention encompasses a transposon-based vector containing 
one or more genes encoding a heavy immunoglobulin (ig) chain and a light Sg chain. Further, more than one gene 
encoding for more than one antibody may be administered in one or more transposon-based vectors of the present 
invention. In this manner, an egg may contain more than one type of antibody in the egg white, the egg yolk or both. 
[0095] In one embodiment, a transposon-based vector contains a heavy Ig chain and a light Ig chain, both operably 
linked to a promoter Figures 5 and 6 schematically depict exemplary constructs of this embodiment. More specifically, 
Figure 5 shows a construct containing a eecropin pre-pro sequence and a eecropin pro sequence, wherein the pre 
sequence functions to direct the resultant protein into the endoplasmic reticulum and the pro sequences and the pro 
sequences are cleaved upon secretion oi the protein from a cell into which the construct has been transfecfed. Figure 
8 shows a construct containing an enterokinase cleavage site, in this embodiment, it may be required to further remove 
several additional amino acids from the light chain following cleavage by enterokinase. in another embodiment the 
transposon-based vector comprises a heavy sg chain operably -linked to one promoter and a light ig chain operably- 
linked to another promoter, Figure 7 schematically depicts an exemplar/ construct of this embodiment. The present 
invention also encompasses a transposon-based vector containing genes encoding portions of a heavy Ig chain and/or 
portions of a light Ig chain. The present invention further includes a transposon-based vector containing a gene that 
encodes a fusion protein comprising a heavy and/or light ig chain, or portions thereof. 

[0098] Antibodies used as therapeutic reagents include but are not limited to antibodies for use in cancer immuno- 
therapy against specific antigens, or for providing passive immunity to an animal or a human against an Infectious disease 
or a toxic agent. Antibodies used as diagnostic reagents include, but are not limited to antibodies that may be labeled 
and detected with a detector, for example antibodies with a fluorescent label attached that may be detected following 
exposure to specific wavelengths. Such labeled antibodies may be primary antibodies directed to a specific antsgehv for 
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example, rbodamine-laheled rabbit anti-growth hormone, or may bo labeled secondary anybodies, such as fiuorescein- 
labeled goat-anti chicken igG. Such labeled antibodies are known to one of ordinary skill in the art. Labels useful for 
attachment to antibodies are also known to one of ordinary skill in the art Some of these labels are described in the 
"Handbook of Fluorescent Probes and Research Products", ninth edition, Richard P. Haugland (ed) Moleouiar Probes, 
5 Inc, Eugene, OR), which is incorporated herein in sts entirety, 

[0037] Antibodies produced with using the present invention may be used as laboratory reagents for numerous ap- 
plications including radioimmunoassay, western blots, dot biots, ELiSA, immunoaffinity columns and other procedures 
requiring antibodies as known to one of ordinary skill in the art. Such antibodies include primary antibodies, secondary 
antibodies and tertiary antibodies, which may be Sabeled or unlabeled. 

[009S] Antibodies that may be made with the practice of the present invention include, but are not limited to primary 
antibodies, secondary antibodies, designer antibodies, anti-protein antibodies, anti-peptide antibodies, anii-DNA anti- 
bodies, anti-RNA antibodies, anti-bormone antibodies, anti-hypophyssotropic peptides, antibodies against non-natural 
antigens, anti-anterior pituitary hormone antibodies, ants-posterior pituitary hormone antibodies, anti-venom antibodies, 
antitumor marker antibodies, antibodies directed against epitopes associated with infectious disease, including, antiviral, 

is a nti- bacteria I, antiprotozoal, ants-fungal, anti-parasitic, anti- receptor, antMipid, anti-phosphoiipid, anti-growth factor, 
anti-cytokine, anti-monokine, anti-fdiotype, and anti-accessory (presentation) protein antibodies. Antibodies made with 
the present invention, as well as light chains or heavy chains, may also be used to inhibit enzyme activity, 
[0099] Antibodies that may be produced using the present invention inciude, but are not limited to, antibodies made 
against the following proteins: Bovine -/-Globulin, Serum; Bovine igG, Plasma; Chicken y-GlobuSim Serum; Human y~ 

so Globulin, Serum; Human igA ? Plasma; Human SgA 1; rvtyeiorna; Human IgA^ Myeloma; Human 3gA 2 , Plasma; Human 
fgD, Plasma; Human IgE, Myeloma; Human IgG, Piasma; Human IgG, Fab Fragment, Plasma; Human IgG, Ftab^ 
Fragment, Plasma; Human IgG, Fc Fragment, Plasma; Human IgG^ Myeloma; Human igG 2 , Myeloma; Human lgG 3 . 
Myeloma; Human igG 4 , Myeloma; Human igM, Myeloma; Human igM, Piasma; Human Smmunogiobuiin, Light Chain -c. 
Urine; Humeri Irnmunoglobuiin, Light Chains k and 'k, Plasma; Mouse y-Glohulin, Serum; Mouse IgG, Serum; Mouse 

25 jgM, Myeloma; Rabbit y-Globutin, Serum; Rabbit igG, Plasma; and Rat y-Giobuiin, Serum, in one embodiment, the 
transposon-hased vector comprises the coding sequence of light and heavy chains of a murine monoclonal antibody 
that shows specificity for human seminoprotein (GenBank Accession numbers AY1 29008 and AY 129304 for the light 
and heavy chains, respectively), 

[D100] A further non-limiting list of antibodies that recognize other antibodies is as follows: Anti-Chicken IgG, heavy 

30 (H) & Sight (L) Chain Specific {Sheep); Anti-Goat y-Giobufin {Donkey); Anti-Goat IgG, Fc Fragment Specific (Rabbit); 
Anti-Guinea Pig "/-Globulin (Goat); Anti-Human Ig, Light Chain, Type k Specific; Anti-Human ig, Light Chain, Type X 
Specific; Anti-Human IgA, a-Chain Specific (Goat); Anti-Human sgA, Fab Fragment Specific: Anti-Human IgA, Fc Frag- 
ment Specific; Anti-Human IgA, Secretory; Anti-Human lg£, e-Chain Specific (Goat); Anti-Human IgE, Fc Fragment 
Specific; Anti -Human igG, Fc Fragment Specific (Goat); Anti-Human §gG, y- Chain Specific (Goat); Anti-Human igG, Fc 

35 Fragment Specific; Anti-Human lgG { Fd Fragment Specific; Anti-Human 5gG, H & L Chain Specific (Goat); Anti-Human 
SgG^ Fc Fragment Specific; Anti-Human tgG 2 , Fc Fragment Specific; Anti-Human tgG 2 , Fd Fragment Specific; Anti- 
Human ksG 3 . Hinge Specific; Anti-Human 3gG 4 , Fc Fragment Specific; Anti-Human tgM, Fc Fragment Specific; Anti- 
Human IgM, ix-Chain Specific: Anti-Mouse igE, t-Chain Specific; Anti-Mouse y-Globulin (Goat); Anti-Mouse IgG, y-Chain 
Specific (Goat); Anti-Mouse IgG, -/-Chain Specific (Goat) F(ab') 2 Fragment; Anti-Mouse IgG, H & L Chain Specific (Goat); 

*o Anti-Mouse IgM, jju-Chain Specific (Goat); Anti-Mouse IgM, H & L Chain Specific (Goat): Anti-Rabbit y-Giobuiin (Goat); 
Ami-Rabbit IgG, Fc Fragment Specific (Goat); Anti-Rabbit IgG, H & L Chain Specific (Goat); Anti- Rat y-Giobulin (Goat); 
Anti- Rat igG, H & L. Chain Specific; Anti-Rhesus Monkeys-Globulin (Goat); and, Anti-Sheep IgG, H & L Chain Specific, 
[0101] Another non-limiting list of the antibodies that may be produced using the present invention is provided in 
product catalogs of companies such as Phoenix Pharmaceuticals, Inc. (www.phoenKxpeptide.com; 530 Harbor Boulevard, 

is Belmont, CA). Peninsula Labs San Garios CA, SIGMA, St. Louis, MO www.sigma-aidrich.com, Cappei ICN 5 Irvine, Cal- 
ifornia, www.icnbiomed.com, and Cafbiochem, La Jolia ; California, www.caibiochem.oom, which are all incorporated 
herein by reference in their entirety. The polynucleotide sequences encoding these antibodies may be obtained from 
the scientific literature, from patents, and from databases such as GenBank, Alternatively, one of ordinary skill in the art 
may design the polynucleotide sequence to be incorporated into the genome by choosing the codons that encode for 

50 each amino add in the desired antibody. Antibodies made by the transgenic animals of the present invention include 
antibodies that may be used as therapeutic reagents, for example in cancer immunotherapy against specific antigens, 
as diagnostic reagents and as laboratory reagents for numerous applications including immunoneutralization s radioim- 
munoassay, western blots, dot blots, EL ISA, immunoprecipitation and immunoaffinity columns. Some of these antibodies 
include, but are not limited to, antibodies which bind the following iigands: adrenornedulin, arnyiin, calcitonin, amyloid, 

55 calcitonin gene-related peptide, choleeystokinin, gastrin, gastric inhibitory peptide, gastrin releasing peptide, interteukin, 
interferon, cortistatin, somatostatin, endothelin, sarafotoxin, glucagon, giucagon-Hke peptide, insulin, athai natriuretic 
peptide, BNP f CMP, neurokinin, substance F, leptin, neuropeptide Y. melanin concentrating hormone, melanocyte stem- 
ulating hormone, orpbanin, endorphin, dynorphin, enkephalin, enkephalin, ieumorphen, peptide F. PACAP, PACAF- 
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related peptide, parathyroid hormone, uroeortim corticotropbin releasing hormone, PHM, PHI, vasoactive intestinal 
polypeptide, secretin, ACTH, angiotensin, angiostatin, bombesin, endostaiirvbradykinin, FMRF amide, galanin, gona- 
dotropin reseating hormone (GnRR) associated peptide, GnRH, growth hormone releasing hormone, inhibin, granulocyte- 
macrophage colony stimulating factor (GM-GSF) . motftirt, neurotensin, oxytocin, vasopressin, osteocalcin, pancreastatin, 
pancreatic polypeptide, peptide YY, proopsome^anocortin, transforming growth factor, vascular endothelial growth factor, 
vesicular monoamine transporter, vesicular acetyichoyne transporter, ghrelin, NPW, NPS, G3d v prokir-etican> thyroid 
stimulating hormone, luteinizing hormone, follicle stimulating hormone, prolactin, growth hormone, beta-iiposr^pin, me- 
fatonin, kallikriens, kinins, prostaglandins, erythropoietin, pi 46 (SEQ ID NO: 18 amino acid sequence. SFG ID NO: 19, 
nucleotide sequence), estrogen. testosterone ; corticosteroids, mineraloeortscoids, thyroid hormone, thymic hormones, 
connective tissue proteins, nuclear proteins, actin. avidin, aciivin, agrin, albumin, and prohormones, propeptides; splice 
variants, fragments and analogs thereof. 

[0102] The following is yet another non-limiting of antibodies that can be produced by the methods of present invention: 
aheixirnab (P.eoPro), abclximab antiplatelet aggregation monoclonal antibody, anf:-OD11a (hu1124>. anti-GD18 anti- 
body, anf:~CD20 antibody, anti-eytomegalovirus (GMV) antibody, anti-digoxin antibody, anti-hepatitis B antibody, anti- 
HER-2 antibody, antiidiotype antibody to GD3 glyeolipid, anti-lgE antibody, anti-!L-2R antibody, a ntl metastatic cancer 
antibody (mAb 17-1 A), anti-rabies antibody, anti-respiratory syncytial virus (RSV) antibody, antl-Rh antibody, anthTGR, 
a nti-TNF antibody, anti-VEGF antibody and lab fragment thereof rattlesnake venom antibody, black widow spider venom 
antibody, coral snake venom antibody, antibody against very late antigen-4 (VLA-4). C225 humanized antibody to EGF 
receptor, chimeric (human & mouse) antibody against TNFa, antibody directed against G PI lb/: I la receptor on human 
platelets, gamma globulin, anti-hepatitis B immunoglobulin, human anti-D immunoglobulin, human antibodies against 
S aureus, human tetanus immunoglobulin, humanized antibody against the epidermal growth recepfor-2. humanised 
antibody against the a subunit of the interleukin-2 receptor, humanized antibody 07 L.A4SG, humanized antibody to the 
IL~2 R enchain, humanized anti-CD40-Sigand monoclonal antibody (5c8}, humanized mAb against the epidermal growth 
reeeptor-2, humanized mAb to rous sarcoma virus, humanized recombinant antibody OgGlk) against respiratory syncytial 
virus (RSV). lymphocyte immunoglobulin {anti^hyrnooyte antibody), lymphocyte immunoglobulin, mAb against factor 
VH, MDX-21G bi-speclfic antibody against HER-2, MDX-22, MDX-220 bi-specific antibody against TAG-72 on tumors. 
MDX- 33 antibody to FcyR1 receptor. MDX-447 bi-speeif ie antibody against EGF receptor, MOX-447 hispecific humanized 
antibody to EGF receptor, MPX-RA imrnunotoxen (mm A linked) antibody, Medi-507 antibody (humanized form of BTi- 
322) against GD2 receptor on T-eeils, monoclonal antibody LDF-02, muromonab-GD3(OKT3} antibody. OKT3 Cmuro- 
momab-CD3' : ) antibody, PRO 542 antibody, ReoPro {"aboiximab") antibody, and 7NF-lgG fusion proteim 
[0103] The antibodies prepared using the methods of the present invention may also be designed to possess ; specific 
labels that may be detected through means known to one of ordenary skill in the art. The antibodies may also be designed 
to possess specific sequences useful for purification through means known to one of ordinary skill in the art Specialty 
antibodies designed for binding specific antigens may also be made in transgenic animals using the transposom based 
vectors of the present invention. 

pi 04] Production of a monoclonal antibody using the transposon-based vectors of the present invention can be 
accomplished In a variety of ways. In one embodiment, two vectors may be constructed: one that encodes the light 
chain, and a second vector that encodes the heavy chain of the monoclonal antibody. These vectors may then be 
incorporated into the genome of the target animal by methods disclosed herein, in an alternative embodiment, the 
sequences encoding light and heavy chains of a monoclonal antibody may be included on a singfe DNA construct. For 
example, the coding sequence of light and heavy chains of a murine monoclonal antibody that show specificity for human 
seminoprotem can be expressed using tra nsposon- based constructs of the present invention (GenBank Accession 
numbers AY 129006 and AY 129304 for the tight and heavy chains, respectively). 

[01 OS] Further mok-ded in the present invention are proteins and peptides synthesized by the immune system including 
those synthesized by the thymus, lymph nodes, spleen, and the gastrointestinal associated lymph tissues (G ALT) system. 
The immune system proteins and peptides proteins that can be made in transgenic animals using the transposon-based 
vectors of the present invention include, but are not limited to ? alpha-lnterferon, beta -interferon, gamma- interferon, alpha- 
interferon A, aipha-interferon 1, G-C8F, GM-GSF, ihteriukin- 1 (IL-1), sL-2. SL-3, !L-4 S IL-5, IL-8, IL~7, IL-8, fU9, JL-10, 1L- 
1 1 ; IL-12, IL-1 3, TfslF-a, and 7NF~ji Other cytokines included in the present invention include cardiotrophin. stromal cell 
derived factor, macrophage derived chemoksne {MDC}. melanoma growth stimulatory activity (fvlGSA), macrophage 
inflammatory proteins 1 alpha (MIP-1 alpha}, 2, 3 alpha, 3 beta, 4 and 5. 

[0106] Lytic peptides such as p146 are also included in the desired molecules of the present invention, fn one em- 
bodiment, the p346 peptide comprises an amino acid sequence of SEQ ID NO: 19. The present invention also encom- 
passes a transposon-based vector comprising a pi 46 nucleic acid comprising a polynucleotide sequence of SEQ ID 
NO:2G. 

[0107] Enzymes are another class ol proteins that may be made through the use of the transposon-based vectors of 
the present invention. Such enzymes Include but are not limited to adenosine deaminase, alpha- gaJactosidase. cellulase, 
coMagenase, dnasel, hyaluronidase. lactase, ^asparaginase, pancreatin, papain, streptokinase B, subt&sin, superoxide 
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dismutase, thrombin, trypsin, urokinase, fibrinolysis giucocerebrosidase and plasminogen activator. In some embodi- 
ments wherein the enzyme could have deleterious effects, additional amino adds and a protease cleavage site are 
added to the carboxy end of the enzyme of interest in order to prevent expression of a functional enzyme. Subsequent 
digestion of the enzyme with a protease results in activation of the enzyme. 

5 [01 08] Extracellular matrix proteins are one class of desired proteins that may be made through the use of the present 
invention. Examples include but are not limited to collagen, fibrin, eiastin, iaminin, and fibroneetin and subtypes thereof, 
intracellular proteins and structural proteins are other classes of desired proteins in the present invention. 
[01 OS] Growth factors are another desired class of proteins that may be made through the use of the present invention 
and include, but are not limited to, transforming growth factor-a ("TGF-a"), transforming growth factor-^ (TGF-p), pfatelet- 

10 derived growth factors (POGF), fibroblast growth factors (FGF), including FGF -acidic isoforms 1 and 2, FGF basic form 
2 and FGF 4, 8, 9 and 10, nerve growth factors (NGF) including NGF 2.5s ; HGF 7.0s and beta NGF and neurotropy ins, 
brain derived neurotrophic factor, cartilage derived factor, growth factors for stimulation of the production of red blood 
cells, growth factors for stimulation of the production of white blood oells, bone growth factors (BGF), basic fibroblast 
growth factor, vascular endothelial growth factor (VEGF), granulocyte colony stimulating factor (G-CSF), insulin like 

15 growth factor (IGF) I and II, hepatocyte growth factor, glial neurotrophic growth factor (GDNF), stem ceil factor (SCF), 
keratinoeyte growth factor (KGF), transforming growth factors (TGF), including TGFs alpha, beta, betaf, beta2, beta3 ; 
skeletal growth factor, bone matrix derived growth factors, bone derived growth factors, erythropoietin (EPO) and mixtures 
thereof, 

[0110] Another desired class of proteins that may be made may be made through the use of the present invention 
include but are not limited to leptin, leukemia inhibitory factor (UF), tumor necrosas factor alpha and beta, ENBREL, 
angiostatin, endostatin, thrornbospondin, osteogenic protein- 1 , bone morphogenetic proteins 2 and 7, osteonectin, so- 
matomedin-like peptide, and osteocalcin, 

[01 11 j A non-limiting list of the peptides and proteins that may be made may be made through the use of the present 
invention is provided in product catalogs of companies such as Phoenix Pharmaceuticals, Inc. (www.phoenixpeptide.com: 

25 530 Harbor Boulevard * Belmont, CA), Peninsula Labs San Carlos CA, SIGMA, Sttouis, MO www.sigraa~aldrich.com, 
Cappel SON, Irvine. California, wwwjcnbiomed.com, and Caibiochem, La Joila. California, wwwxalbiochemcomv The 
polynucleotide sequences encoding these proteins and peptides of interest may be obtained from the scientific literature, 
from patents, and from databases such as GenBank, Alternatively, one of ordinary skill in the art may design the poly- 
nucleotide sequence to be incorporated into the genome by choosing the codons that encode for each amino acid en 

30 the desired protein or peptide. 

[01 1 2} Some of these desired proteins or peptides that may be made through the use of the present invention include 
but are not limited to the following: adrenoroedulin, aroyisn, calcitonin, amyloid, calcitonin gene-related peptide, chole- 
eystokmin, gastrin, gastric inhibitory peptide, gastrin releasing peptide, interleiskin, interferon, cortistatin, somatostatin, 
endotheiim sarafotoxin, glucagon, giucagon-like -peptide, insulin, atrial natriuretic peptide, BNP, CNF, neurokinin, sub- 

35 stance P 5 leptin, neuropeptide Y, melanin concentrating hormone, melanocyte stimulating hormone, orphanin, endorphin, 
dynorphin, enkephalin, leumorphin. peptide F, PACAP, PACAP-reiated peptide, parathyroid hormone, urocortin, cortf- 
cotrophsn releasing hormone, PHM, PHI, vasoactive intestinal polypeptide, secretin, ACTH, angiotensin, angiostatin, 
bombesin, endostatin, bradykinin, FMRF amide, gaianin, gonadotropin releasing hormone (GnRH) associated peptide, 
GnRH, growth hormone releasing hormone, inhibin, granulocyte-macrophage colony stimulating factor (GM-CSF), mo- 

40 tiiin, neurotensin, oxytocin, vasopressin, osteocalcin, pancreastatin, pancreatic polypeptide, peptide YY, proopiomelano- 
cortin, transforming growth factor, vascular endothelial growth factor, vesicular monoamine transporter, vesicular ace- 
tylcholine transporter, ghrelin, NPW; NPB, C3d, prokinetican, thyroid stimulating hormone, luteinizing hormone, follicle 
stimulating hormone, prolactin, growth hormone, beta^spotropin, melatonin, kallskriens. kinins, prostaglandins, erythro- 
poietin, pi 46 (SEQ \D NO: 19, amino acid sequence, SEQ ID NO:20, nucleotide sequence), thymic hormones, connective 

45 tissue proteins, nuclear proteins, actin, avidin, activin, aghn, albumin, and prohormones, propeptides, splice variants, 
fragments and analogs thereof. 

[0113] Other desired proteins that may be made by the transgenic animals of the present invention include bacitracin, 
poiymixin b, vancomycin, cyclosporins, anii-R3Y antibody, alpha- 1 antitrypsin .(AAT), anti -cytomegalovirus antibody, 
anti-hepatitis antibody, anti-inhibitor coagulant complex, anti-rabies antibody, anti~Rh(D) antibody, adenosine deami- 

50 nase s anii-digoxin antibody, antivenin crotalidae (rattlesnake venom antibody), antivenin latrodectus (black widow spider 
venom antibody), antivenin micrurus (coral snake venom antibody), aprotinin, corticotropin (ACTH} 5 diphtheria antitoxin, 
lymphocyte immune globulin (anti4bymocyte antibody), protamine, thyrotropin, capreomycin, o>ga§aetosidase, gramici- 
din, streptokinase, tetanus toxoid, tyrothhcin, IGF-1 . proteins of varicella vaccine, anti-TNF antibody, antMl-2r antibody, 
anti-HER~2 antibody, OKT3 ( <! murornonab~CD3 1[ ) antibody, TNF~igG fusion protein, ReoPro ("abcixirnab") antibody, 

55 ACTH fragment 1 -24, desmopressin, gonadotropin -releasing hormone, histreNn, leuprolede, iypressin, nafarelin, peptide 
that binds GPHb/GPIIIa on platelets (integhlin), goserelin, capreomycin, colistin, a ntl- respiratory syncytial virus, lynv 
phocyte immune globulin (Thymoglovin, Atgam), panorex, alpha-antitrypsin, botulinin, lung surfactant protein, tumor 
necrosis receptor-igG fusion protein (enbrei), gonadorelin, proteins of influenza vacdne, proteins of rotavirus vaccine, 
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proteins of haernophiius b conjugate vaccine, proteins of poiiovirus vaccine, proteins of pneumococcal conjugate- vaccine, 
proteins of meningococcal C vaccine, proteins of influenza vaccine, megakaryocyte growth and development factor 
(MGDF), neuroimmunophilin ligand-A (Nit~A), brajfvderived neurotrophic factor (BDNF) f glial cell line-derived neuro- 
trophic factor {GDN F) , lepiin (native), leptin B, leptin G, IL-1 RA (irtterieukin-1 RA), R-568, novel erythropGiesis-stirnulating 

5 protein (IMESP), humanized mAb to reus sarcoma virus (MEDM93), glutamyl-tryptophan dipeptide 1M882, .LFA-3TIP 
immunosuppressive, humanized anti-CD404lgand monoclonal antibody (5c8) f geisonin enzyme, tissue factor pathway 
inhibitor (TFPI), proteins of meningitis B vaccine, antimetastatic cancer antibody (mAb 1 7-1 A), chimeric (human & mouse) 
mAb against TNFa, mAb against factor VH, reiaxin; eapreornycin, glyeopeplide (LY333328), recombinant human acti- 
vated protein C (rhAPC), humanized mAb against the epidermal growth receptor-2., altepase, anti-CD20 antigen, 02 B8 

ig antibody, insulin-like growth factor- 1 , atrial natnuretic peptide fanaritide), tenecfaplase, ant$~CD1 la antibody {hu 1 124), 
anti-GD 18 antibody. mAb LDP-02, anti-VEGF antibody, fab fragment of anti-VEGF Ab, APQ2 ligand (tumor necrosis 
factor -related apoptosis- inducing ligand), rTGF- p {transforming growth factor~($} 5 alpha-antitrypsin, ananain (a pineapple 
enzyme), humanized mAb CTLA4IG, PRO 542 (mAb), D2E7 (mAb), calf Intestine alkaline phosphatase, tt^^fdurohidase, 
a-L-gaiactosidase {huntangiutamie acid decarboxylase, acid sphingomyelinase, bone morphogenetic protein-2 (rhSivlP- 

*5 2), proteins of HIV vaccine, T eel? receptor (TGR) peptide vaccine, TCR peptides, V beta 3 and V beta 13.1. (IR502), 
{IR501) t Bl 1050/1272 mAb against very late antigen-4 (VLA-4), C225 hu?rtanized mAb to EOF receptor, anti idiotype 
antibody to GD3 giycolipid. antibacterial peptide against H. pylon, IV1DX-447 bispecific humanized mAb to EG F receptor, 
ami- cytomegalovirus (CMV), Medi-491 B1 9 parvovirus vaccine, humanized recombinant mAb {IgGlk} against respiratory 
syncytia? virus (P.SV), urinary tract infection vaccine (against "pili ,! on Escherichia co// strains), proteins ; of fym£ disease 

20 vaccine against B, burgdorferi protein (DbpA), proteins of Medi-501 human papilloma virus- 11 vaccine (HPV), Strepto- 
coccus pneumoniae vaccine, Medi-507 mAb (humanized form of BTk322) against CD2 receptor on T-oeiis, MDX-33 
mAb to FcyRI receptor, MDX-RA imrnunotoxin (ricin A linked) mAb, MDX-210 bi-specific mAb against HER-2, MDX- 
447 bi-specsfic mAb against EGF receptor, MDX-22, MDX-220 bi-specific rnAb against TAG-72 on tumors, colony- 
stimulating factor (CSF) {molgramcstsrru, humanized mAb to the IL-2 R a-chain {basiliximab), mAb to IgE (IGE 025A), 

25 myelin basic protein-altered peptide (MSP771 A), humanized mAb against the epidermal growth reeeptor~2 9 humanized 
mAb against the a subunit of the interieukin-2 receptor, low molecular weight heparin, antt-hemophillic factor, and 
bacterloidai/pe rmeab? I sty- i no r easing p rote i n (r>.BPl). 

[0114] The peptides and proteins made using the present invention maybe labeled using labels and techniques known 
to one of ordinary skill in the art. Some of these labels are described in the "Handbook of Fluorescent Probes and 
so Research Products", ninth edition, Richard P. Haugland (ed) Molecular Probes, inc. Eugene, OR), which is incorporated 
herein in its entirety. Some of these labels may be geneticaiiy engineered into the polynucleotide sequence for the 
expression of the selected protein or peptide. The peptides and proteins may also have label- incorporation ''handles 1 
incorporated to allow labeling of an otherwise difficult or impossible to label protein. 

[01 15] It is to be understood that the various classes of desired peptides and proteins, as well as specific peptides 
35 and proteins described in this section may be modified as described below by inserting selected codons for desired 
amino acid substitutions into the gene incorporated into the transgenic animal. 

[01 1 8] The present Invention may also be used to produce desired molecules other than proteins and peptides including, 
but not limited to, lipoproteins such as high density lipoprotein {HDL), HDL~Milaoo, and low density apoprotein, lipids, 
carbohydrates, siRNA and ribozymes. In these embodiments, a gene of interest encodes a nucleic acid molecule or a 

4Q protein that directs production of the desired molecule. 

[0117] The present invention further encompasses the use of inhibitory molecules to inhibit endogenous {i.e., non- 
vector) protein production. These inhibitory molecules Include antisense nucleic acids. siRNA and inhibitory proteins. In 
one embodiment a transposombased vector containing an ovalbumin DMA sequence, that upon transcription forms a 
double stranded RNA molecule, is transfeeted into an animal such as a bird and the bird's production of endogenous 

45 ovalbumin protein is reduced by the interference RNA mechanism (RNAi). Additionally, inducible knockouts or knock- 
downs of the endogenous protein may be created to achieve a reduction or inhibition of endogenous protein production. 

Modified Desired Proteins an^^^ 

so [0118] "Proteins", "peptides," "polypeptides" and "oligopeptides" are chains of amino acids (typically L -amino acids) 
whose alpha carbons are linked through peptide bonds formed by a condensation reaction between the carboxyi group 
of the alpha carbon of one amino acid and the amino group of the alpha carbon of another amino aesd The terminal 
amino acid at one end of the chain (i.e., the amino terminal) has a free amino group, while the terminal amino acid at 
the other end of the chain (i.e, ? the carboxy terminal) has a free carboxyi group. As such, the term "amino terminus" 

55 {abbreviated N4errninus) refers to the free afpba-amino group on the amsno acid at the amino terminal of the protein, 
or to the alpha-ami no group (imino group when participating in a peptide bond} of an amino acid at any other location 
within the protein. Similarly, the term "carboxy terminus'" (abbreviated C-terminus) refers to the free carboxyi group on 
the amino acid at the carboxy terminus of a protein, or to the carboxyi group of an amino acid at any other location within 
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the proteen. 

£0119] Typically, the amino acids making up a protein are numbered in order, starting at the amino terminal and 
increasing in the direction toward the carboxy terminal of the protein. Thus, when one ammo acid is said to "follow" 
another, that amino acid is positioned closer to the carboxy terminal of the protein than the preceding amino acid. 

5 [01 20] The term "residue" is used herein to refer to an amino acid (D or L) or an amino acid mimetic that is incorporated 
into a protein by an amide bond. As such, the amino acid may be a naturally occurring amino acid or, unless otherwise 
limited, may encompass known analogs of natural amino acsds that function in a manner similar to the naturally occurring 
amino acids (Le,, amino acid mimeiics). Moreover, an amide bond mimetic includes peptide backbone modifications 
well known to those skilled in the art. 

io [01 21 ] Furthermore, one of skill will recognize that, as mentioned above, individual substitutions, deletions or additions 
which alter, add or delete a single amino acid or a small percentage of amino acids (typically less than about 5%, more 
typically less than about 1%) in an encoded sequence are conservatively modified variations where the alterations result 
in the substitution of an amino aced with a chemically similar amino acid. Conservative substitution tables providing 
functionally similar amino acids are well known in the art. The following six groups each contain amino acids that are 

1& conservative substitutions for one another: 

1) Alanine (A), Serine (S), Threonine (T); 

2) Aspartic acid (D) f Glutamic acid (E); 

3) Asparagine (N), Gluiamine (Q); 
20 4) Arginine (R), Lysine (K); 

5) Isofeucine {})> Leucine (L)„ Methionine (M), Valine (V); and 
8) Phenylalanine (F), Tyrosine (Y), Tryptophan (W), 

[01 22] A conservative substitution is a substitution in which the substituting amino acid (naturally occurring or modified) 
25 is structurally related to the amino acid being substituted, i.e., has about the same size and electronic properties as the 
amino acid being substituted. Thus, the substituting amino acid would have the same or a similar functional group in the 
side chain as the original amino acid. A "conservative substitution" also refers to utilizing a substituting amino acid which 
ss identical to the amino acid beeng substituted except that a functional group in the side chain is protected with a suitable 
protecting group, 

30 [0123] Suitable protecting groups are described in Green and Wuts, "Protecting Groups in Organic Synthesis", John 
Wiley and Sons, Chapters 5 and 7, 1 991 , the teachings of which are incorporated herein by reference. Preferred protecting 
groups are those which facilitate transport of the peptide through membranes, for example, by reducing the-hydrophiltclty 
and increasing the Hpophiiicity of the peptide, and which can be cleaved, either by hydrolysis or enzyrnatiealiy (Ditter et 
aL, 1988. J. Pharm, Sol 57:783; Ditter etal, 1968. J. Pharrn, ScL 57:828; Ditter etaL, 1969. J. Pharm. Sci. 58:557; King 

36" et aL 1987. Biochemistry 26:2294; Lindberg et aL 1989. Drug Metabolism and Disposition 17:31 1 ; Tunek et aL 1388. 
Biochem. Pharm. 37:3387; Anderson et aL 1985 Arch. Biochem. Biophys, 239:538; and Singhal et aL, 1987. FASEB 
J. 1 :220), Suitable hydroxy! protecting groups include ester, carbonate and carbamate protecting groups. Suitable amine 
protecting groups include acyi groups and aikoxy or aryioxy carbonyl groups, as described above for N terminal protecting 
groups. Suitable carboxylic acid protecting groups include aliphatic, benzyl and aryl esters, as described below for C- 

40 terminal protecting groups. In one embodiment, the carboxylic acid group in the side chain of one or more glutamic acid 
or aspartic acid residues in a peptide of the present invention is protected, preferably as a methyl, ethyl, benzyl or 
substituted benzyl ester, more preferably as a benzyl ester. 

[0124] Provided below are groups of naturally occurring and modified amino acids in which each amino acid an a group 
has similar electronic and steric properties. Thus, a conservative substitution can be made by substituting an amino acid 
46 with another amino acid from the same group, It is to be understood that these groups are non~Hmi!ing, Le. that there 
are additional modified amino adds which could be included in each group. 

Group \ includes leucine, isoleucine, valine, methionine and modified amino acids having the following side chains: 
ethyl, n-propyl ivbutyl. Preferably, Group I includes leucine, isoleudne, valine and methionine. 
50 Group IE includes glycine, alanine, valine and a modified amino acid having an ethyl side chain. Preferably, Group 

IS includes glyesne and alanine. 

Grotap HE includes phenylalanine, phenylgrycirte, tyrosine, tryptophan, cyclohexylmethyi glycine, and modified amino 
residues having substituted benzyl or phenyl side chains. Preferred substituents include one or more of the following: 
halogen, methyl, ethyL nitro, -MH 2s methoxy, ethoxy and CN, Preferably, Group III includes phenylalanine, tyrossne 
55 and tryptophan, 

Group IV includes glutamic acid, aspartic acid, a substituted or unsubstituted aliphatic, aromatic or henzylic ester 
of glutamic or aspartic acid {e.g., methyl, ethyL n-propy! iso-propyl, cyciohexyl, benzyl or substituted benzyl), 
glutamine, asparagine, -CO-NH- alkylated glutamine or asparagines (e.g., methyl ethyL n-propyl and iso-propy!) 
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and modified amino acids having the side chain -{CH 2 } 3 ~GGOH t an ester thereof (substituted or unsubstituted 
aliphatic, aromatic or benzyiic ester), an amide thereof and a substituted or unsubstituted N-aikylated amide thereof. 
Preferably, Group IV includes; glutamic acid, aspartic acid, methyl aspartate, ethyl aspartate, benzyl aspartate and 
methyl glutamate^ ethyl giutarnate and benzyl glutamate, glutamine and asparagine. 

Group V includes hisfidine, lysine, ornithine, arginine, N-nitroargsnsne, p-cycloargtnine, y-hydroxyarginene, N-amid- 
inocitruiine and 2-amino-4-guamdinobutanqic acid, homoiogs of lysine, homo§ogs of arginine and homoiogs of or- 
nithine. Preferably, Group V includes histidine, lysine, arginine and ornithine. A homolog of an amino acid includes 
from 1 to about 3 additional or subtracted methylene units in the side chain. 

Group VI includes serine, threonine, cysteine and modified amino acids having C1-C5 straight or branched alky I 
side chains substituted with ~OK or -SH, for example, -CH 2 CH 2 OH ; - CH 2 CH^CH 2 OH or -CH 2 CH 2 GHCH 3 . Preferably, 
Group VI includes serine, cysteine or threonine, 

[0125] In another aspect, suitable substitutions for amino add residues include "severe" substitutions- A "severe 
substitution" is a substitution in which the substituting amino acid (naturally occurring or modified) has significantly 
different size and/or electronic properties compared with the amino acid being substituted. Thus, the side chain of the 
substituting amino acid can be significantly larger (or smaller) than the side chain of the amino acid being substituted 
an6/or can have functional g roups with significantly different electronic properties than the amino acid being substituted. 
Examples of severe substitutions of this type include the substitution of phenylalanine or cyoiohexyirnethyl glycine for 
alanine, isoleucine for glycine, a D amino acid for the corresponding L amino acid, or N H-GHrc-CH^-COOHl-GO- for 
aspartic acid Alternatively, a functional group may be added to the side chain, deleted from the side chain or exchanged 
with another f unctional group. Examples of severe substitutions of this type include adding of valine, leucine or isoleucine, 
exchanging the carboxyiic acid in the side chain of aspartic acid or glutamic acid with an amine, or deleting the amine 
group in the side chain of lysine or ornithine. In yet another alternative, the side chain o? the substituting amino acid can 
have significantly different steric and electronic properties that the functional group of the amino acid being substituted. 
Examples of such modifications include tryptophan for glycine, lysine for aspartic acid and - (GH 2 ) 4 COOH for the side 
chain of serine. These examples are not meant to be limiting. 

[01 26] In another embodiment, for example in the synthesis of a peptide 28 amino acids in length, the individual amino 
acids may bo substituted according in the following manner: 

AA j is serine, glycine, alanine, cysteine or threonine; 
AA 2 is alanine, threonine, glycine, cysteine or serine: 

AA 3 is valine, arginine, leucine, isoleucine, methionine, ornithine, lysine, N-nitroarginine, p-cycloarginine, y~hy- 

droxyarginine, N-amidinocitruiine or 2-amino™4™guanidinobutanoic acid: 

AA 4 is proline, leucine, valine, isoleucine or methionine; 

AA 5 is tryptophan, alanine, phenylalanine, tyrosine or glycine; 

AA 6 is serine, glycine, alanine, cysteine or threonine; 

AA 7 is proline, leucine, valine, isoleucine or methionine; 

AA 8 is alanine, threonine, glycine, cysteine or serine; 

AA 9 is alanine, threonine, glycine, cysteine or serine; 

AA 10 is leucine, isoleucine, methionine or valine; 

AA 1:1 is serine, glycine, alanine, cysteine or threonine; 

AA, 2 is leucine, isoleucine, methionine or valine; 

AA, 3 is leucine, isoleucine, methionine or valine; 

AA 14 is glutamine, glutamic acid, aspartic acid, asparagine, or a substituted or unsubstituted aliphatic or aryi ester 
of glutamic acid or aspartic acid; 

AA 15 is arginine, N-nitroarginine, p~cycloarginine, Y-hydroxy~arginine, M-arnidinocitruline or 2-amino-4~guank1ino- 
butanoic acid 

AA 16 is proline, leucine, valine, isoleucine or methionine; 
AA 17 is serine, glycine, alanine, cysteine or threonine; 

AA 13 is glutamic acid, aspartic add, asparagine, glutamine or a substituted or unsubstituted aliphatic or aryi ester 
of glutamic acid or aspartic acid; 

AA 19 is aspartic acid, asparagine, glutamic aeid, glutamine, leucine, valine, isoleucine, methionine or a substituted 
or unsubstituted aliphatic or aryi ester o! glutamic acid or aspartic acid; 

AA 20 *s valine, arginine, leucine, isoleucine, methionine, ornithine, lysine, N-nitroarginine, p-eycloarginine, y~by- 

droxyarginine, N-amidinocllruiine or 2-amirK>4~guan8dinobutanoic acid; 

AA 21 is alanine, threonine, glycine, cysteine or serine; 

AA 22 is alanine, threonine, glycine, cysteine or serine; 

AA 23 is histidsne, serine, threonine, cysteine, lysine or ornithine; 
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AA 24 is threonine, aspartic acid, serine, glutamic acid or a substituted or unsuhstitutad aliphatic or aryl ester of 
glutamic acid or aspartic acid; 

AA 2 5 is asparagine, aspartic acid" glutamic acid, glutamine, leucine, valine, isoieucine, methionine or a substituted 
or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid; and 
AA 26 is cysteine, histsdine, serine, threonine, lysine or ornithine. 

[0127] It is to be understood that these amino acid substitutions may be made for longer or shorter peptides than the 
26 mer in the preceding example above, and for proteins. 

[01 2S] In one embodiment of the present invention, codons for the f;rst several N-terminal amino acids o\ the trans- 
posase are modified such that the third base of each codon is changed to an A or a T without changing the corresponding 
amino acid, it is preferable that between approximately 1 and 20, more preferably 3 and 1 5, and most preferably between 
4 and 12 of the first N-terroina! codons of the gene of interest are modified such that the third base of each codon is 
changed to an A or a T without changing the corresponding amino acid. In one embodiment, the first ten M-terroinal 
codons of the gene of interest are modified in this manner. 

[0129] When several desired prateans, protein fragments or peptides are encoded in the gene of interest to be incor- 
porated into the genome, one of skill in the art will appreciate that the proteins* protein fragments or peptides may be 
separated by a spacer molecule such as, for example, a peptide, consisting of one or more amino acids. GeneraHy, the 
spacer will have no specific biological activity other than to join the desired proteins, protein fragments or peptides 
together, or to preserve some minimum distance or other spatial relationship between them. However, the constituent 
amino acids of the spacer may be selected to influence some property of the molecule such as the folding, net charge, 
or hydrophobicity. The spacer may also be contained within a nucleotide sequence with a purification handle or be 
flanked by proteolytic cleavage sites, 

[01 30] Such polypeptide spacers may have from about 5 to about 40 amino acid residues. The spacers in a polypeptide 
are independently chosen, but are preferably ail the same. The spacers should a§low for flexibility of movement in space 
and are therefore typically rich in small amino acids, for example, glycine, serine, proline or alanine. Preferably, peptide 
spacers contain at least 60%, more preferably at least 80% glycine or alanine. In addition, peptide spacers generally 
have little or no biological and antigenic activity. Preferred spacers are (Gly-Pro-Q!y-Gly) x {SEQ ID NO: 5) and {Giy 4 -Ser) v , 
wherein x is an integer from about 3 to about 9 and y is an integer from about 1 to about 8. Specific examples of suitable 
spacers include 
(G!y-Pro-Giy-G!y) 3 

SEQ ID NO:6 G!y Pro Gly Gly G!y Pro Gly Gly Giy Pro G!y Gly 
(Giy4~Ser)3 

SEQ ID NO: 7 GSy Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
or (Giy4~S«r)4 

SEQ IDNO:8 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
Gly Gly Gly Gly Ser. 

[0131] Nucleotide sequences encoding for the production of residues which may be useful in purification of the ex- 
pressed recombinant protein may also be buHt into the vector. Such sequences are known in the art and include the 
glutathione binding domain from glutathione S-transferase, polyiysine, bexa-histidine or other cationic amino acids, 
thsoredoxsn, hemagglutinin antigen and maltose binding protein. 

[01 32] Additionally, nucleotide sequences may be inserted into the gene of interest to be incorporated so that the 
protein or peptide can also include from one to about six amino acids that create signals for proteolytic cleavage. In this 
manner, if a gene is designed to make one or more peptides or proteins of snterest in the transgenic animal specific 
nucleotide sequences encoding for amino aotds recognized by enzymes may be incorporated into the gene to facilitate 
cleavage of the large protein or peptide sequence into desired peptides or proteins or both. For example, nucleotides 
encoding a proteolytic cleavage site can be introduced into the gene of interest so that a signal sequence can be cleaved 
from a protein or peptide encoded by the gene of interest. Nucleotide sequences encoding other amino acid sequences 
which display pH sensitivity or chemical sensitivity may also be added to the vector to facilitate separation of the signal 
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sequence from the peptide or protein of interest- 

[0133] in one embodiment of the present invention, a TAG sequence is linked to the gene of interest. The TAG 
sequence serves three purposes: 1 } ft allows free rotation of the peptide or protein to be isolated so there is no interference 
from the native protein or signal sequence, I.e. vitellogenin, 2) it provides a "'purification handle" to isolate the protein 
using column purification, and 3) it includes a cleavage site to remove the desired protean from the signal and purification 
sequences. Accordingly, as used herein, a TAG sequence includes a spacer sequence, a purification handle and a 
cleavage site. The spacer sequences in the TAG proteins contain one or more repeats shown in SEQ ID NG:25. A 
preferred spacer sequence comprises the sequence provided in SEQ ID NO:26, One example of a purification handle 
is the gp41 hairpin loop from HIV L Exemplary gp41 polynucleotide and polypeptide sequences are provided in SEQ ID 
NO: 24 and SEQ ID NO:23, respectively. However, it should be understood that any antigenic region may be used as a 
purification handle, including any antigenic region of gp41. Preferred purification handles are those that elicit highly 
specific antibodies. Additionally, the cleavage site can be any protein cleavage site known to one of ordinary skill In the 
art and includes an enterokinase cleavage site comprising the Asp Asp Asp Asp Lys sequence (SEQ ID NO:9) and a 
furin cleavage site. Constructs containing a TAG sequence are shown in Figures 2 and 3. in one embodiment of the 
present invention, the TAG sequence comprises a polynucleotide sequence of SEQ ID NQ:22. 

[0134] In addition to the transposon-hased vectors described above 5 me present invention a&so includes the use of a 
vector according to the invention for producing a transgenic animal and makes reference to methods of administering 
the transposon-based vectors to an animal, methods of producing a transgenic animal wherein a gene of interest is 
incorporated into the germ line of the animal and methods of producing a transgenic an; mas wherein a ciene of anterest 
is incorporated into cells other than the germiine cells of the animal The transposon- based vectors of the present 
invention may be administered to an animal via any method known to those of skill in the art, including, but hot Jim 
to, intraembryonic, entrafesticuiar, intraoviduct, intraperitoneal, intraarterial, intravenous, topical, oral, nasal, and pronu- 
clear Injection methods of administration, or any combination thereof. The transposon-based vectors may also be ad- 
ministered within the lumen of an organ, into an organ, into a body cavity, into the cerebrospinal fluid, through the urinary 
system or through any route to reach the desired ceils. 

[0135] The transposon-based vectors may be delivered through the vascular system to be distributed to the cells 
supplied by that vessel. For example, the compositions may be placed in the artery supplying the ovary- or supplying the 
faiiopian tube to transfect cells in those tissues. In this manner, foil ides could be tr ansfected to create a germiine 
transgenic animal. Alternatively, supplying the compositions through the artery leading to the oviduct would preferably 
transfect the tubular gland and epithelial cells. Such transfected cells could manufacture a desired protein or peptide for 
deposition in the egg white. Administration of the compositions through the portal vein would target uptake and trans- 
formation of hepatic cells, Administration through the urethra and into the bladder would target the transitional epithelium 
of the bladder. Administration through the vagina and cervix would target the lining of the uterus. Administration through 
the internal mammary artery would transfect secretory ceils of the Isolating mammary gland to perform a desired function, 
such as to synthesize and secrete a desired protein or peptide into the milk. 

[0136] In a preferred embodiment, the animal is an egg-laying animal, and more preferably, an avian. In one embod- 
iment, between approximately 1 and 50 j&g, preferably between 1 and 20 ^g, and more preferably between 5 and 10 
pig of transposon-based vector DNA is administered to the oviduct of a bird, Optimal ranges depending upon the type 
of bird and the bird's stage of sexual maturity, Intraoviduct administration of the transposon based vectors of the present 
invention result in a PGR positive signal in the oviduct tissue, whereas intravascular administration results in a PGR 
positive signal in the liver, m other embodiments, the fransposon-based vector is administered to an artery that supplies 
the oviduct or the liver These methods of administration may also be combined with any methods for facilitating trans- 
fection, including without limitation, eiectroporation, gene guns, injection of naked DNA, and use of dimethyl sulfoxide 
(DMSO). 

[01 37] The present invention includes the use of a vector according to the invention for infraembnyonio administration 
of a vector according to the invention to an avian embryo comprising the following steps: 1) incubating an egg on its 
side at room temperature for two hours to allow the embryo contained therein to move to top dead center (TDC); 2) 
drMling a hole through the shell without penetrating the underlying sheH membrane; 3) injecting the embryo with the 
transposon based vector in solution; 4) sealing the hole in the egg; and 5) placing the egg in an incubator tor hatching. 
Administration of the fransposombased vector can occur anytime between immediately after egg iay (when the embryo 
is at Stage X) and hatching. Preferably, the fransposon-based vector is ad ministered between i and 7 days after egg 
lay, more preferably between 1 and 2 days after egg lay. The transposombased vectors may be introduced into the 
embryo in amounts ranging from about 5.0 ^g to 10 pg ? preferably 1 .0 jig to tOO pgv Additionally, the transposon- based 
vector solution volume may be between approximately 1 ^i to 75 ^3 in quail and between approximately 1 $ to 500 $ 
in chicken. 
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[01 38] The present in vention also includes the use of a vector according to the invention for intratesticular administration 
of a transposon-based vector including injecting a bird with a composition comprising the transposon-based vector, an 
appropriate carrier and an appropriate transfection reagent, in one embodiment, the bird is injected before sexual maturity, 
preferably between approximately 4- 14 weeks, more preferably between approximately 6-14 weeks and most preferably 
between 8-1 2 weeks old. In another embodiment, a mature bird is injected with a transposon-based vector an appropriate 
carrier and an appropriate transfection reagent. The mature bard may be any type of bird, but in one example the mature 
bird is a quail. 

[0139] A bird is preferably injected prior to the development of the biood-testss barrier, which thereby facilitates entry 
of the transposon-hased vector into the seminiferous tubules and transaction of the spermatogonia or other germane 
cells. At and between the ages of 4, 8, 8, 10, 12 ; and 14 weeks, it is believed that the testes of chickens are likely to be 
most receptive to transfection. In this age range, the biood/test&s barrier has not yet formed, and there is a relatively high 
number of spermatogonia relative to the numbers of other cell types, e.g., spermatids, etc. See J. Kumaran et aL, 1349. 
Poultry ScL, 29:511-520. See also E. Qakberg, 1958. Am. J. Anatomy, 99:507-515; and P. Kluin et aL, 1984, Anal. 
Embryo!., 169:73 78. 

[0140] The transposombased vectors may be introduced into a testis in an amount ranging from about 0.1 to 10 
jjtg, preferably 1 g&g to 10 $xg> more preferably 3 to 10 jxg. sn a quail, about 5 j^g is a preferred amount, in a chicken, 
about 5 jxg to 10 p,g per testis is preferred. These amounts of vector DNA may be injected in one dose or multiple doses 
and at one site or multiple sites in the testis. In a preferred embodiment, the vector DNA is administered at multiple sites 
in a single testis, both testes being injected in mis manner, in one embodiment, injection is spread over three injection 
sites: one at each end of the testis, and one in the middle. Additionally, the transposombased vector solution voiume 
may he between approximately 1 jjJ to 75 *d in quail and between approximately 1 fU to 500 \i\ in chicken. In a preferred 
embodiment, the transposon-based vector solution volume may be between approximately 20 jjJ to 60 ^l in quail and 
between approximately 50 jjJ to 250 in chicken. Both the amount of vector DNA and the fotai volume injected into 
each testis may be determined based upon the age and ssze of the bird. 

[0141] According to the present invention, the transposon-based vector is to be administered in conjunction with an 
acceptable carrier and/or transfection reagent. Acceptable carriers include, but are not limited to, water, saline. Hanks 
Balanced Salt Solution (HBSS) r Tris-EDTA (TE) and lyotropic fiaquid crystals. Transfection reagents commonly known 
to one of ordinary skill in the art that may be employed include, but are not limited to, the following: eationic lipid transfection 
reagents, cation ic lipid mixtures, polyamine reagents, liposomes and combinations thereof; SUP£RFECT<5>. Cytofectene, 
BioPORTER<E>, GenePORTER®, Neuro PORTER®, and perfectin from Gene Therapy Systems; lipofectamine, cellfectin, 
DMR§E-C oligofectaroine, and PLUS reagent from InVitrogen; Xtreme gene, fugene, DOSPER and DOTAP from Roche; 
Upotaxi and Genejammer from Strategene; and Escort from SIGMA. In one embodiment, the transfection reagent is 
SUFERFECT®. The ratio of DNA to transfection reagent may vary based upon the method of administration. m one 
embodiment, the transposon-based vector is administered infratestieuiariy and the ratio of DNA to transfection reagent 
can be from 1 : 1 .5 to 1:15, preferably 1 :2 to 1 :10 5 aH expressed as wt/voL Transfection may a£so be accomplished using 
other means known to one of ordinary skill in the art, including without limitation eieotroporation, gene guns, injection of 
naked DNA, and use of dimethyl sulfoxide (DMSO). 

[0142] Depending upon the cell or tissue type targeted for transfection, the form of the transposon-based vector may 
be important. Pfasmids harvested from bacteria are generally closed circular superceded molecules, and this bs the 
preferred state of a vector for gene delivery because of the ease of preparation, in some instances, transposase ex- 
pression and insertion may be more efficient in a relaxed, dosed circular configuration or in a linear configuration, in 
still other snstances, a purified transposase protein may be co-injected with a transposon-based vector containing the 
gene of interest tor more immediate insertion. This could be accomplished by using a transfection reagent compBexed 
with bom the purified transposase protein and the transposon-based vector, 

[01 43] Following administration of a transposon-based vector to an animal DNA is extracted from the animal to confirm 
integration of the gene of interest. Actual frequencies of integration are estimated both by comparative strength of the 
PGR signal, and by histological evaluation of the tissues by quantitative PGR. Another method for estimating the rate 
of transgene insertion is the so-called primed In situ hybridization technique {PRfNS}. This method determines not oniy 
which cells carry a transgene of interest, but also into which chromosome the gene has inserted, and even what portion 
of the chromosome. Briefly, labeled primers are annealed to chromosome spreads (affixed to glass slides) through one 
round of PGR, and the slides are then developed through normal in situ hybridization procedures. This technique combines 
the best features of in situ PGR and fluorescence in situ hybridization (FISH) to provide distinct chromosome location 
and copy number of the gene m question. The 28s rRNA gene will be used as a positive control for spermatogonia to 
confirm that the technique is functioning properly. Using different fluorescent §ahels for the transgene and the 28s gene 
causes cells containing a transgene to fluoresce with two different colored tags- 
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[01 44] Breeding experiments are also conducted to determine if germiine transmission of the transgene has occurred, 
in a genera! bud breeding experiment performed according to the present invention, each male bird was exposed to 2-3 
different adult female birds for 3-4 days each. This procedure was continued with different females for a total period of 
8-12 weeks. Eggs were collected daisy for up to 14 days after the last exposure to the transgenic rnaie 3 and each egg 
5 was incubated in a standard incubator, in the first series of experiments the resulting embryos were examined for 
transgene presence at day 3 or 4 using PGR. 

[0145] Any male producing a transgenic embryo was bred to additional females, Eggs from these females wore 
incubated, hatched, and the chicks tested for the exogenous DNA. Any embryos that died were necropsied and examined 
directly for the transgene or protein encoded by the transgene, either by fluorescence or PGR. The offspring that hatched 
io and were found to be positive for the exogenous DNA were raised to maturity. These birds were bred jo produce further 
generations of transgenic birds, to verify efficiency of the transgenic procedure and the stabEe incorporation of the 
transgene into the germ fine. The resulting embryos were examined for transgene presence at day 3 or 4 using PGR. 
[01 46] it is to foe understood that the above procedure can be modified to suit animals other than birds and that selective 
breeding techniques may be performed to amplify gene copy numbers and protein output 
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Production of Desired Proteins or Peptides in Eg,g VVhite 



[0147] In one embodiment, the t ra nsposon • based vectors of the present invention may be administered to a bird for 
production of desired proteins or peptides in the egg white. These trasnposon -based vectors preferably contain one or 
» more of an ovalbumin promoter, an ovomucoid promoter, an ovalbumin signal sequence and an ovo|goi signat 
sequence. Oviduct-specific ovalbumin promoters are described in B. O'Malley etai., 1987. EM BO J., vol 5, pp. 2395 -12; 
A. Glu et aL, 1994. Proc. Nat Acad. Sea, (USA), vol. 91, pp. 4451-4455; D. Monroe et al., 2000. Biochim. Biophys. Acta, 
1517 (1):27-32; H. Park et at., 2000. Bioehem , 39:8537-8545; and T. Murarnatsu et al. s 1998 Poult Av;an Biol. Rev., 
6:107-123. Examples of transposon- based vectors designed for production of a deseed protein in an egg white are- 
shown in Figures 2 and 3. 



[0148] The present invention is particularly advantageous for production of recombinant peptides and protei rcs of low 
30 solubility In the egg yolk. Such proteins include, but are not limited to, membrane-associated or membrane-bound 
proteins, lipophilic compounds; attachment factors, receptors, and components of second messenger transduction ma- 
chinery. Low solubility peptides and proteins are particularly challenging to produce using conventional recombinant 
protein production techniques {cell and tissue cultures) because they aggregate an water-based, hydropbilic environ- 
ments. Such aggregation necessitates denaturation and re-folding of the recombinantly-produced proteins, which may 
55 deSeteriousiy affect their structure and function, Moreover, even highly soluble recombinant peptides and proteins may 
precipitate and require denaturation and renaturation when produced in sufficiently high amounts in recombinant protein 
production systems. The present invention provides an advantageous resolution of the problem of protein and peptide 
soSubiiity during production of Sarge amounts oi recombinant proteins. 

[0149] In one embodiment of the present invention, deposition of a desired protein into the egg yolk is accomplished 
40 by attaching a sequence encoding a protein capable of binding to the yolk vitellogenin receptor to a gene of interest that 
encodes a desired protein. This transposon-based vector can be used for the receptor- mediated uptake of the desired 
protein by the oocytes. In a preferred embodiment, the sequence ensuring the binding to the vitellogenin receptor is a 
targeting sequence of a vitellogenin protein. The invention encompasses various vitellogenin proteins and their targeting 
sequences, in a preferred embodiment a chicken vitellogenin protein targeting sequence is used, however, due to the 
45 high degree of conservation among vitellogenin protein sequences and known cross-species reactivity of vitellogenin 
targeting sequences with their egg-yolk receptors, other vitellogenin targeting sequences can be substituted. One ex- 
ample of a construct for use in the tra nsposon- based vectors of the present invention and for deposition of an insulin 
protein in an egg yolk is provided in SEQ ID NO:27. In this embodiment, the transposon-based vector contains a 
vitellogenin promoter, a vitellogenin targeting sequence, a TAG sequence, a pro4nsuiin sequence and a syntheuc polyA 
so sequence. The present invention includes, but is not limited to, vitellogenin targeting sequences residing in the N4erroinal 
domain of vitellogenin, particularly in HpovitelHn L in one embodiment, the vitellogenin targeting sequence contains the 
polynucleotide sequence of SEQ ID NO: 18. 

£8150] In a preferred embodiment, the transposon-hased vector contains a transposase gene operably-iinked to a 
Hver-specific promoter and a gene of interest operably-iinked to a promoter selected from the group consisting of an 
55 ovalbumin promoter, a conalbumin promoter, a vitellogenin promoter or an ovomucoid promoter and a vitellogenin 
targeting sequence. Figure 4 shows an example of such a construct. In another preferred embodiment the transposes 
based vector contains a transposase gene oper&biy- linked to a constitutive promoter and a gene of interest operably- 
iinked to a promoter selected from the group consisting of an ovalbumin promoter, a conaibumin promoter, a vitellogenin 
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promoter or an ovomucoid promoter and a vitellogenin targeting sequence. 



Isolation and Purification of Desired Protein or Peptide 



[01 51 ] For large-scale production of protein, an animal breeding stock that is homozygous for the transgene is preferred. 
Such homozygous individuals are obtained and identified through, for example, standard animal breeding procedures 
or PGR protocols. 

[01 52] Once expressed, peptides, polypeptides and proteins can be purified according to standard procedures known 
to one of ordinary skill in the art, including ammonium sulfate precipitation, affinity columns, column chromatography > 
gel electrophoresis, high performance liquid chromatography, immunoprecipitation and the like, Substantially pure com- 
positions of about 50 to 99% homogeneity are preferred, and 80 to 95% or greater homogeneity are most preferred for 
use as therapeutic agents. 

[0153] in one embodiment of the present invention, the animal in which the desired protein is produced is an egg- 
saying animal, in a preferred embodiment of the present invention, the animal is an avian and a desired peptide, polypeptide 
or protein is isolated from an egg white. Egg white containing the exogenous protein or peptide is separated from the 
yo§k and other egg constituents on an industrial scale by any of a variety of methods known in the egg industry. See, 
e.g., W. Stadefman et al (Eds.} ; Egg Science & Technology, Haworth Press ; Singhamton, NY (1995), Isolation of the 
exogenous peptide or protein from the other egg white constituents is accomplished by any of a number of polypeptide 
isolation and purification methods well known to one of ordinary skis! in the art. These techniques include, for example, 
chromatographic methods such as gel permeation, ion exchange, affinity separation, metal chelation, HFLC, and the 
like, either atone or in combination. Another means that may be used lor isolation or purification, either in lieu of or in 
addition to chromatographic separation methods, includes electrophoresis. Successful isolation and purification is con- 
firmed by standard analytic techniques, including HPLC, mass spectroscopy, and spectrophotometry. These separation 
methods are often facilitated if the first step in the separation is the removal o! the endogenous ovalbumin fraction of 
egg white, as doing so will reduce the total protein content to be further purified by about 50%. 

[0154] To facilitate or enable purification of a desired protein or peptide, transposon-based vectors may include one 
or more additionai epitopes or domains. Such epitopes or domains include DNA sequences encoding enzymatic or 
chemical cleavage sites including, but not limited to, an enterokinase cleavage site; the glutathione binding domain from 
glutathione S4ransferase; polylysine; hexa-histidine or other cationic amino acids; thioredoxin: hemagglutinin antigen; 
maltose binding protein; a fragment of gp41 from HIV; and other purification epitopes or domains commonly known to 
one of skill in the art. 

[01 SS] in one representative embodiment, purification of desired proteins from egg white utilizes the antigenicity of 
the ovalbumin carrier protein and particular attributes of a TAG linker sequence that spans ovalbumin and the desired 
protein. The TAG sequence is particularly useful in th*s process because if contains 1) a highly antigenic epitope^ a 
fragment of gp41 from HlV : allowing for stringent affinity purification, and, 2} a recognition site for the protease enterok- 
inase immediately juxtaposed to the desired protein, in a preferred embodiment the TAG sequence comprises approx- 
imately 50 amino acids. A representative TAG sequence is provided below. 



[01 58] The underlined sequences were taken from the hairpin ?oop domain of HIV gp-41 (SEG ID NO:23). Sequences 
in italics represent the cleavage site for enterokinase (SEQ ID NG;9). The spacer sequence upstream of the loop domain 
was made from repeats of {Pro Ala Asp Asp Ala) (SEQ 3D NO:255 to provide free rotation and promote surface availability 
of the hairpin loop from the ovalbumin carrier protein. 

[0157] isolation and purification of a desired protein is performed as follows: 

1 . Enrichment of the egg white protein fraction containing ovalbumin and the transgenic ovaibumimT AG-desired 
protein. 

2. Size exclusion chromatography to isolate only those proteins within a narrow range of molecular weights (a further 
enrichment of step 1), 

3„ Ovalbumin affinity chromatography. Highly specific antibodies to ovalbumin will eliminate virtually ail extraneous 
egg white proteins except ovalbumin and the transgenic ovalbumin-TAG-desired protein. 

4. gp41 affinity chromatography using ants~gp41 antibodies. Stringent application of this step will result in virtually 
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pure transgenic pvalbumin-T AG-desired protein. 

5. Cleavage of the transgene product can be accomplished in at least one -of two ways: 

a. The transgenic ovalbumin-TAG-desirecf protein is left attached to the gp41 affinity resin (beads) from step 4 
and the protease enterokinase is added. This liberates the transgene target protein from the gp4l affinity resin 
while the ovalbumin TAG sequence is retained- Separation by centhfugation (in a batch process) or flow through 
(in a column purification), leaves the desired protein together with enterokinase in solution; Enterokenase is 
recovered and reused. 

b. Alternatively, enterokinase is Immobilized on resin (beads) by the addition of pohHysihe moieties to a non- 
catalytic area of the protease. The transgenic ovalbumin-TAG-desired protein ©luted from the affinity column 
of step 4 is then applied to the protease resin. Protease action cleaves m^ ovalbumin-TAG sequence from the 
desired protein and leaves both entities in solution. The immobilized enterokinase resin is recharged and reused. 

c. The choice of these alternatives is made depending upon the size and chemical composition of the transgene 
target protein. 

8 A final separation of either of these two (5a or 5b) protein mixtures is made using size exclusion, or enterokinase 
affinity chromatography. This step allows for desalting, butter exchange and/or polishing, as needed. 

10158] Cleavage of the transgene product (ova lb urn isvTAG -desired protein) by enterokinase, then, results in two 
products: ovaibumln-TAG and the desired protein. More specific methods for isolation using the TAG label is provided 
in the Examples. Some desired proteins may require additions or modifications of the above -described approach as 
known to one of ordinary skill in the art. The method is scaleabie from the laboratory bench to pilot and production facility 
largely because the techniques applied are well documented in each of these settings. 

[0159] ft is believed that a typical chicken egg produced by a transgenic animal of the present invention will contain 
at least 0,001 mg, from about 0.001 to 1.0 mg, or from about 0.001 to 100.0 mg of exogenous protein, peptide or 
polypeptide, in addition to the normal constituents of egg white (or possibly replacing a small fraction of the latter). 
[01 80] One of skill in the art will recognise that after biological expression or purification, the desired proteins, fragments 
thereof and peptides may possess a conformation substantially different than the native conformations of the proteins, 
fragments thereof and peptides. In this case, it is often necessary to denature and reduce protein and then to cause the 
protein to re-fold into the preferred conformation. Methods of reducing and denaturing proteins and inducing re-folding 
are well known to those of skill in the art. 

Production of Protein or Peptide in -Milk 



[0161] In addition to the use of a vector according to the invention for producing eggs containing transgenic proteins 
or peptides, the present invention encompasses the use of a vector according to the invention for the production of milk 
containing transgenic proteins or peptides. Each use includes the administration of a transposon -based vector described 
above to a mammal, in one embodiment, the transposon -based vector contains a transposase operably- linked to a 
constitutive Promoter and a gene of interest operab!y4inked to mammary specific promoter. Genes of interest can 
include, but are not limited to antiviral and antibacterial proteins and immunoglobulins. 

Treatment of Disease and Animal improvement 



82] In addition to production and isolation of desired molecules, the transposon-based vectors of the present 
invention can be used for the treatment of various genetic disorders. For example, one or more transposon-based vectors 
can be administered to a human or animal for the treatment of a single gene disorder including, but not limited to, 
Huntington's disease, aipha-1 -antitrypsin deficiency Alzheimer s disease, various forms or breast cancer, cystic fibrosis, 
galactosemia, congenital hypothyroidism, maple syrup urine disease, neurofibromatosis 1 , phenylketonuria, sickle eels 
disease, and Smith- Lemli-Opitz (SLO/RSH) Syndrome. Other diseases caused by single gene disorders thai may be 
treated with the present invention include, autoimmune diseases, shipping fever in cattle, mastitis; bacterial or viral 
diseases, alteration of skin pigment in animals. In these embodiments, the fransposon-based vector contains a non ■ 
mutated, or non-disease causing form of the gene known to cause such disorder. Preferably, the transposase contained 
within the transposase-based vector is operably linked to an inducible promoter such as a tissue- specific promoter such 
that the non -mutated gene of interest is inserted into a specific tissue wherein the mutated gene is expressed in vivo. 
[91 63] In one embodiment of the present invention, a transposo-vbased vector comprising a gene encoding proinsulin 
is administered to diabetic animals or humans for incorporation into liver cells in order to treat or cure diabetes. The 
specific incorporation of the proinsulin gene Mo the liver is accomplished by placing the transposase gene under the 
control of fiver-specific promoter, such as G6P. This approach is useful for treatment of both Type I and Type- Is diabetes. 
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The G6P promoter has been shown to be glucose responsive {Arguad, D. s et aL 1996, Diabetes 45:1563-1571}, and 
thus, glucose-regulated insulin production is achieved using DMA constructs of the present invention, Integrating a 
proinsuNn gene into liver ceils circumvents the problem of destruction of pancreatic islet cells in the course of Type i 
diabetes. 

[0184] in another embodiment, shortly after diagnosis of Type I diabetes, the cells of the immune system destroying 
pancreatic p-eeils are selectively removed using the transposon-based vectors of the present invention, thus allowing 
normal j>ceiis to repopulate the pancreas. 

[01 65] For treatment of Type II diabetes, a transposon-based vector containing a proinsulin gene is specifically incor- 
porated into the pancreas by placing the transposase gene under the control of a pancreas-specific promoter, such as 
an insulin promoter. In this embodiment, the vector is delivered to a diabetic animal or human via injection into an artery 
feeding the pancreas. For delivery, the vector is compiexed with a transfection agent. The artery distributes the complex 
throughout the pancreas, where individual cells receive the vector DNA, Following uptake into the target cell, the insulin 
promoter is recognized by transcriptional machinery of the ce3l, the transposase encoded by the vector is expressed, 
and stable integration of the proinsulin gene occurs, it is expected that a small percentage of the transposombased 
vector bs transported to other tissues, and that these tissues are transfected. However, these tissues are not stably 
transfected and the proinsulin gene is not incorporated into the coils' DNA due to failure of these ceils to activate the 
insulin promoter. The vector DNA is likely lost when the ceil dies or degraded over time. 

[0168] in other embodiments, one or more transposon- based vectors are administered to an avian for the treatment 
of a viral or bacterial infection/disease including, but not limited to, Coiibaciiiosis (Coiiform infections), Mycoplasmosis 
(CRD, Air sac, Sinusites), Fowl Cholera, Necrotic Enteritis, Ulcerative Enteritis (Quail disease), PuUorurn Disease, Fowl 
Typhoid, Botulism, Infectious Coryza, Erysipelas, Avian Pox, Newcastle Disease, infectious Bronchitis, Quail Bronchitis, 
Lymphoid Leukosis, Marek's Disease (Visceral Leukosis), infectious Bursal Disease (Gumboro). in these embodiments, 
the transposombased vectors may be used in a manner similar to traditional vaccines, 

[01 S7] m still other embodiments, one or more transposombased vectors are administered to an animal for the pro- 
duction of an animai with enhanced growth characteristics and nutrient utilisation. 

[01 68] The transposomfoased vectors of the present invention can be used to transform any animal ceil, including but 
not limited to: ceils producing hormones, cytokines, growth factors, or any other biologically active substance; ceils of 
the immune system; cells of the nervous system; muscle (striatal cardiac, smooth) ceifs; vascular system cells; endothelial 
ceils; skin cells; mammary cells; and lung cells, including bronchial and alveolar ceils. Transformation of any endocrine 
cell by a transposombased vector is contemplated as a part of a present invention. In one aspect of the present invention, 
cells of the immune system may be the target for incorporation of a desired gene or genes encoding for production of 
antibodies. Accordingly, the thymus, bone marrow, beta lymphocytes (or B ceils), gastrointestinal associated lymphatic 
tissue (GALT), Payer's patches, bursa Fabricius, lymph nodes, spleen, and tonsil, and any other lymphatic tissue, may 
all be targets for administration ot the compositions of the present invention, 

[01 $9] The transposon-based vectors of the present invention can be used to modulate (stimulate or inhibit) production 
of any substance, including but not limited to a hormone, a cytokine, or a growth factor, by an animal or a human ceiL 
Modulation of a regulated signal within a eel! or a tissue, such as production of a second messenger, is also contemplated 
as a part of the present invention. Use of the transposombased vectors of the present invention is contemplated for 
treatment of any animal or human disease or condition that results from underproduction (such as diabetes} or overpro- 
duction (such as hyperthyroidism) of a hormone or other endogenous biologically active substaf ice. Use of the transposon- 
based vectors of the present invention to integrate nucleotide sequences encoding RNA molecules, such as antisense 
RNA or short interfering RNA, is also contemplated as a part of the present invention. 

[0170] Additionally, the transposon -based vectors of the present invention may be used to provide ceils or tissues 
with "beacons", such as receptor molecules, for binding of therapeutic agents in order to provide tissue and cell specificity 
for the therapeutic agents. Several promoters and exogenous genes can be combined in one vector to produce pro- 
gressive, controlled treatments from a single vector delivery. 

[0171] The following exampies will sen/e to further illustrate the present invention without, at the same time, however, 
constituting any limitation thereof. On the contrary, if is to be clearly understood that resort may be had to various 
embodiments, modifications and equivalents thereof which, after reading the description herein, may suggest themselves 
to those skiiied in the art without departing from the spirit of the invention, 

EXAMPLE 1 

Preparation of Transposon-Based Vector pTn Mod 

[01 72] A vector was designed for inserting a desired coding sequence into the genome of eukaryotic cells, given below 

as SEQ \D NO: 1,- The vector of SEG ID NO:1, termed pTnMod, was constructed and its sequence verified. 

[0173] This vector employed a cytomegalovirus (CMV) promoter, A modified Kozak sequence {ACCATG} (SEQ ID 
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NO: 13) was added to the promoter. The nucleotide in the wobble position in nucleotide triplet codons encoding the first 

1 0 ammo acids of transposase 'was changed to an adenine (A) or thymine (T) f which did not aster the amino acid encoded 
by this codon. T wo stop codons were added and a synthetic polyA was used to provide a strong termination sequence. 
This vector uses a promoter designed to be active soon after entering the cell (without any induction) to increase the 

5 likelihood of stable integration. The additional stop codons and synthetic polyA insures proper termination without read 
through to potential genes downstream, 

[0174] The first step in constructing this vector was to modify the transposase to have the desired changes, Modifi- 
cations to the transposase were accomplished with the primers High Efficiency forward primer (Hei) Altered transposase 
{ATShHef 5' ATCTCGAGACCATGTGTGAACTTGATATTTTACATGATTCTCTTl'ACC 3' (SEQ ID NO: 10) and Altered 

io transposase- High efficiency reverse primer (Her) 5 ! G A XTQ ATC ATT ATC ATA ATTTCCCC A A AG C GT A ACO S^SEQ ID 
NO: 1 1 , a reverse complement primer). In the 5 ; forward primer ATS- Hef. the sequence CTCGAG (SEQ ID NO: 12} is 
the recognition site for the restriction enzyme Xho I, which permits directional cloning of the amplified gene. The sequence 
ACCATG {SEQ ID NO: 13} contains the Kozak sequence and start codon for the transposase and the underlined bases 
represent changes in the wobble position to an A or T of coders for the first 10 amino acids {without changing the amino 

15 acid coded by the codon). Primer ATS- Her {SEQ ID HO A 1 ) contains an additional stop coder; TAA in addition to native 
stop codon TGA and adds a Bel \ restriction site, TGATCA (SEQ ID NO: 14). to allow directional cloning these primers 
were used in a PGR reaction with pTnLac {p defines piasmid, tn defines transposes and lac defines the beta fragment 
of the lactose gene, which contains a multiple cloning site) as the template for the transposase and a FaisSafe^ PGR 
System {which includes enzyme, buffers, dNTF's, MgGU and PGR Enhancer; Epicentre Technologies, Madison, Wi). 

20 Amplified PGR product was electrophoresed on a 1% agarose gel : stained with ethldsum bromide, and visualized on an 
ultraviolet transiiluminator, A hand corresponding to the expected size was excised srom the gel and purified from the 
agarose using a Zymo Clean Gel Recover Kit (Zymo Research, Orange. OA). Purified DN A was digested with restriction 
enzymes Xho \ &) and Bel I {3') (New England Blolabs, Beverly, MA) according to the manufacturer's protocol Digested 
DN A was purified from restriction enzymes using a Zymo DNA Glean and Concentrator kit (Zymo Research), 

25 [0175] Piasmid gWhiz {Gene Therapy Systems, San Diego, OA) was digested with restriction enzymes Sal I and 
BamH I (New England Siolabs), which are compatible with Xho I and Bel L but destroy the restriction sites; Digested 
gWbiz was separated on an agarose gel, the desired band excised and purified as described above: Gutting 
in this manner facilitated directional cloning of the modified transposase (mATS) between the CMV promoter and synthetic 
poly A, 

so [0176] To insert the mATS between the CMV promoter and synthetic polyA in gVVhlz, a Stratagene |"4 Ligase Kit 
(Stratagene, Inc. La Jolla, OA) was used and the ligation set up according to the manufacturers protocol, Llgated product 
was transformed into E. co//Top10 competent cells (invitrogen Life Technologies, Carlsbad, G A) using chemical trans- 
formation according to Invitrogen's protocol Transformed bacteria were incubated in 1 ml of SOC (GIBCO BRL, CAT# 
15544-042) medium for 1 hour at 37° C before being spread to LB {Luna-Sertani media (broth or agar}) plates suppie- 

35 mented with 100 ^g/ml ampiciilin (LB/amp plates). These plates were incubated overnight at 37° C and resulting colonies 
picked to LB/amp broth for overnight growth at 37° G, Piasmid DNA was isolated using a modified alkaline lysis protocol 
(Sambrook et aL 1989), electrophoresed on a 1% agarose gel, and visualized on a U.V. transiilurhinator after ethidium 
bromide staining. Colonies producing a piasmid of the expected size {approximately 6.4 kbp) were cultured in at least 
250 ml of LB/amp broth and piasmid DNA harvested using a Qiagen lvtaxi~Prep Kit (column purification) according to 

40 the manufacturer's protocol {Qiagen, Inc., Chatsworth, CA), Column purified DNA was used as template for sequencing 
to verify the changes made in the transposase were the desired changes and no further changes or mutations occurred 
due to PGR amplification. For sequencing, PerkimElmer's Big Dye Sequencing Kit was used. All samples were sent to 
the Gene Probes and Expression Laboratory (LSiJ School of Veterinary Medicine) for sequencing on a Perkim Elmer 
Model 377 Automated Sequencer. 

45 [0177] Once a clone was identified that contained the desired mATS in the correct orientation, primers CMVf-NgoM 
IV (5' TTG CCG G C ATC AG ATT GGCT AT (SEQ ID NO:15); underlined bases denote NgoM IV recognition site) and Syn- 
poiyA-BstE IE (5* AG AG G TC ACC GGGTC A A TTGTTG AG C ACCTG GT A (SEQ ID NO: 16); underlined bases denote BstE 

11 recognition site) were used to PGR amplify the entire CMV promoter, mATS t and synthetic poiyA for cloning upstream 
of the transposon in pTnLao The PGR was conducted with FaiiSafe 7 * as described above, purified using the Zymo 

50 Clean and Concentrator kit, the ends digested with NgoM IV and BstE H (New England Biolabs), purified with the Zymo 
kit again and cloned upstream of the transposon in pTn Lac as described below. 

[0178] Piasmid pTnLac was digested with MgoM W and BstE II to remove the ptae promoter and transposase and the 
fragments separated on an agarose gel The band corresponding to the vector and transposon was excised, purified 
from the agarose, and dephosphorylated with cal! intestinal alkaline phosphatase (New England S;oiabs) to prevent 
55 self-annealing. The enzyme was removed from the vector using a Zymo DNA Clean and Concenfrator-S. The purified 
vector and CMVp/mATS/poiyA were ligated together using a Stratagene T4 Ligase Kit and transformed into E. coiias 
described above. 

[01 79] Colonies resulting from this transformation were screened (mini-preps) as describe above and clones that were 
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the correct size were verified by DNA sequence analysis as described above. The vector was given the name pTnMod 
(SEQ 3D NO: 1) and includes the following components: 

Base pairs 1-130 are a remainder of F1{-) on from pBluescriptfl sk(-) (Stratagene), corresponding to base pairs 
1-130 of pBiuescripti! sk(-). 

Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pa^rs 133 -1777 are the CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems}, corre- 
spending to bp 229-1 873 of pGWiz. The CMV promoter was modified by the addition of an ACC sequence upstream 
Of ATG. 

Base pairs 1778-1773 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 1780 - 2987 are the coding sequence for the transposase, modified from Tn10 (GenBank accession 
J01329) by optimizing codons for stability of the transposase mRNA and for the expression of protein, More spe- 
cifically, in each of the codons for the first ten amino acids of the transposase, G or C was changed to A or T when 
such a substitution would not alter the amino acid that was encoded. 
Base pairs 2988-2993 are two engineered stop codons. 

Base pair 2994 is a residue from ligation of restriction enzyme sites used in constructing the vector, 

Base pairs 2995 - 3410 are a synthetic pofyA sequence taken from the pGVfe vector {Gene Therapy Systems), 

corresponding to bp 1 922-2337 of 1 0 pGWiz. 

Base pairs 3415 - 3718 are non-coding DNA that is residual from vector pNK2859. 
Base pairs 3719 - 3761 are non-coding X DNA that is residual from pNK2359. 

Base pairs 3762 - 3831 are the 70 bp of the left insertion sequence recognized by the transposon TnlO. 
Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 3838 - 4527 are the multiple cbning site from pBluescriptii sk{20) 3 corresponding to bp 924-235 of 
pBiuescriptil sk(-) u This multiple cloning site may be used to insert any coding sequence of interest into the vector. 
Base pairs 4528-4532 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 4533 - 4802 are the 70 bp of the right insertion sequence recognizee! by the transposon Tn10, 
Base pairs 4803 - 4644 are non-coding X DNA that is residual from pNK2859. 
Base pairs 4645 - 5488 are non-coding DNA that is residual from pNK2859. 

Base pairs 5489 - 7689 are from the pBiuescripti! sk(-) base vector - (Stratagene, Inc), corresponding to bp 761 -2981 
of pBiuescripti sk{-). 

Completing pTnMod is a pBiueScript backbone that contains a colE I origin of replication and an antibiotic resistance 
marker (arnpiciliin). 

It should be noted that all non-coding DNA sequences described above can be replaced with any other non-coding 
DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants. 
All plasmid DMA was isolated by standard procedures. Briefly, Escherichia coli containing the plasmid was grown 
in 500 mL altquots of LB broth (supplemented with an appropriate antibiotac) at 37°C overnight with shaking. Plasmid 
DNA was recovered from the bacteria using a Qiagen Maxs-Prep kit (Qiagen, inc., Chatsworth, CA) according to 
the manufacturers protocol Plasmid DNA was resuspended in 500 **.L of PCR-grade water and stored at -20°C 
until used. 



EXAMPLE 2 



Preparation of Transposon Based Vector pTnMod (CM 'VI Red) 

[0180] A vector was designed for inserting a reporter gene (DsRed) under the control of the CMV promoter into the 
genome of vertebrate ceils given below as SEQ ID NO:2. The reporter gene chosen was the DsRed gene, driven by the 
immediate early cytomegalovirus promoter, to produce a plasmid called pTnCMV/DsRed. The DsRed gene product is 
a red fluorescent protein from an IndoPacific sea anemone, Discosoma sp. s which fluoresces bright red at 558 nm. It is 
to be understood that the reporter gene, i.e., the DsRed gene, is only one embodiment of the present invention and that 
any gene of interest may be inserted into the plasmid in place of the DsRed reporter gene in any Experiment described 
herein, 

[0181] The vector of SEQ ID NO:2, named pTnMod (CMV/Red). was constructed, and its sequence verified by re- 
sequencing. SEQ ID NO:2, pTnMod (CMV/Red), includes the following components: 

Base pairs 1-130 are a remainder of F1(» on from pBluescripiii sk(-) (Stratagene), corresponding to bp 1-130 of 
pBiuescripti sk(-). 

Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 133 -1777 are the CMV promoter/enhancer taken from vector pQWiz {Gene Therapy Systems, corre- 
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spending to bp 229-1 873 of pGWiz. 

Base pairs 1778 -1779 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 1780 - 2987 are the coding sequence for the transposase, modified from Tn10 (GenSank accession 
J01 829) by optimizing codons as discussed above. 
Base pairs 2988-2993 are two engineered step codons. 

Base pair 2994 is a residue from Bigafion of restriction enzyme sites used in constructing the vector. 

Base pairs 2995 - 3410 are a synthetic polyA sequence taken from the pGVViz vector (Gene Therapy Systems), 

corresponding to bp 1922-2337 of pGWiz, 

Base pairs 3415 - 3718 are non-coding DMA that is residual from vector pNK2859. 
Base pairs 3719 - 3761 are non-coding k BUA that is residual from pNK2859, 

Base pairs 3762 - 3831 are the 70 bp of the left insertion sequence recognized by the transposoh Tun 10. 
Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 3838 - 4044 are pari of the multiple cloning site from pBluescriptii sk{~). corresponding to bp 924-718 of 
pBluescriptll sk{-). 

Base pairs 4045-4048 are a residue from ligation oi restriction enzyme sites used in constructing the vector. 
Base pairs 4049-5693 are the CMV promoter/enhancer, taken from vector pGWiz (Gene Therapy Systems), corre- 
sponding to hp 229-1873 of pGWiz. 

Base pairs 5894-5701 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 5702 - 5617 are the DsRed reporter coding sequence, including polyA sequence, from pDsReo'M 
(Clontech} ; corresponding to bp 77 - 992 oi pDsRedl.1. 

Base pairs 6818-7101 are part of the multiple cloning site from pBluescriptll sk{~), corresponding to bp 718-235 of 
pBluescriptll sk(~), 

Base pairs 7102-7106 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 7107 - 7176 are the 70 bp of the right insertion sequence recognized by the transposon In 10. 
Base pairs 7177 - 7218 are non-coding X QUA that is residuai from pNK2869, 
Base pairs 7219 - 8082 are non-coding DNA that ss residuai from pNK2S59. 

Base pairs 8063 - 1 0283 are from the pBluescriptll sk(-) base vector (Strafagene, inc.), corresponding to fop 761-2981 
of pBiueschptll sk{~). 

it shouid be noted that ail non-coding DNA sequences described above can be replaced with any other non-coding 
DNA sequenee(s). 

EXAMPLE 3 

Preparation of Transposon-Based Vector pTnM&d (Oval* Red) - Chicken 

[0182] A vector was designed for inserting a reporter gene (DsRed) under the control of the ovalbumin promoter, and 
including the ovalbumin signal sequence, into the genome of a bird. One version of thss vector is given below as SEG 
ID NO:3. The vector of SEO ID NO;3 5 named pTnMod (Oval/Red) - Chicken, inciudes chicken ovalbumin promoter and 
signal sequences. 

[0183] SEQ ID NQ:3 ; pTnMod (Ovai/Red) - Chicken, includes the following components: 

Base pairs 1-130 are a remainder of F.1{*) on from pBluescriptst sk{-} (Stratagene), corresponding to bp 1-130 of 
pBiueschptll sk(-}. 

Base pairs 131-132 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 133 -1777 are the CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems, corre- 
sponding to bp 229-1873 of pGWiz, 

Base pairs 1 778-1 779 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 1780 - 2987 are the coding sequence for the transposase, modified from TnlO (GenBank accession 
J01829) by optimizing codons as discussed above. 
Base pairs 2988-2993 are two engineered stop codons. 

Base pair 2994 is a residue from ligation of restriction enzyme sites used in constructing the vector. 

Base pairs 2995 - 3410 are a synthetic polyA sequence taken from the pGWiz vector (Gene Therapy Systems), 

corresponding to bp 1922-2337 of pGWiz. 

Base pairs 3415 - 3718 are non-coding DNA that is residual from vector pNK2859. 

Base pairs 3719 - 3761 are non-coding % DNA thai is residual from 10 pNK2859. 

Base pairs 3782 - 3831 are the 70 bp of the left insertion sequence recognized by the transposon TnlO. 

Base pairs 3832-3837 are a residue from .ligation of restriction enzyme sites used in constructing the vector. 

Base pairs 3838 - 4044 are part of the multiple cloning site from pBluescriptii sk(~), corresponding to bp 924- 718 of 
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pBiueschptH sk(-). 

Base pairs 4045-4049 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 4050 - 4951 contain upstream elements of the (including SORE, steroid-dependent response element). 
See GenBank accession number J00895 M24999 ; bp 431-1 332. Base pairs 4952-4953 are a residue from ligation 
of restriction enzyme sites used in constructing the vector. 

Base pairs 4960 - 511 2 are the chicken ovalbumin signal sequence (GenBank accession number J00895 M24999 
bp 2996-31 48). 

Base pairs 5113-5118 are a residue from ligation of restriction enzyme srtes used in constructing the vector. 
Base pairs 5119-6011 are the DsRed reporter coding sequence, inducing poiyA sequence, from pDsRed11 
(Ciontech), corresponding to bp 1 00 - 992 of pDsRed 1 , 1 . 

Base pairs 8012-6017 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 6018 - 6056 are part of the multiple cloning site of the ZeroSiunt Topo cloning vector finvitroqen). 
corresponding to bp 337-377 of ZeroBiunt. 

Base pairs 6057-6062 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 6063 - 6495 are part of the murine cloning site from pBiuescriptll sk{~) y corresponding to bp 667-235 of 
pbiueschptH sk(~). 

Base pairs 6496-6500 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 6501 - 6570 are the 70 bp of the right insertion sequence recognized by the transposon TntG, 
Base pairs 6571 < 6612 are non-coding X DNA that is residual from pNK2859, 
Base pairs 6613 - 7477 are non-coding DNA that is residual from pNK2859. 

Base pairs 7478 - 9678 are from the pBluesehpfH sk(-} base vector (Stratagene, Inc.). corresponding to bp 761-2961 
of phiueschptll sk(-), 

It should be noted that all non-coding DNA sequences described above can be replaced with any other non-codang 
DNA sequencers). 

EXAMPLE 4 

Preparation of Transposon-Baseo Vector pTnMod(Ova!/ Red) - Qua// 

[01 84] A vector was designed for inserting a reporter gene (DsRed) under the control of the ovalbumin promoter, and 
including the ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:4. The vector of SEQ 
ID NO:4, named pTnMod (QvaS/Red) - Quail, has been constructed, and selected portions of the sequence have been 
verified by re-sequencing. 

[0185] SEQ ID NO:4 3 pTnMod (Oval/Red) - Quail, includes the following components: 

Base pairs 1-130 are a remamder of F1(~) on from pBluescnptll sk(~) (Straiagene), corresponding to bp 1-130 of 
pSIueschptll sk(-K 

Base pairs 131 - 132 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 133 - 1777 are the CMV promoter/enhancer taken from vector p&Wiz (Gene Therapy Systems), corre- 
sponding to bp 229-1 873 of pGWiz, 

Base pairs 1778-1779 are a residue from ligation of restriction enzyme srtes used in constructing the vector. 
Base pairs 1780 - 2987 are the coding sequence for the transposase, modified from T'nIO {GenBank accession 
J01829) by optimizing codons as discussed above. 

Base pairs 2988-2993 are two engineered stop codons. Base pair 2994 is a residue from ligation of restriction 
enzyme sites used in constructing the vector. 

Base pairs 2995 - 3410 are a synthetic poiyA sequence taken from the pGWiz vector (Gene Therapy Systems) 
corresponding to bp 1922-2337 of pGWiz, 

Base pairs 3415 - 3718 are non-coding DMA that is residual from vector pNK2859. 
Base pairs 3713 - 3761 are non-coding k DNA that is residual from pNK2853. 

Base pairs 3782 - 3831 are the 70 base pairs of the left insertion sequence recognized by the transposon Tn10, 
Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 3838 - 4044 are part of the multiple cloning site from pBlueseriptli sk{™}, corresponding to bp 924-71 8 of 
pBlueschptll sk(~). 

Base pairs 4045-4043 are a residue from iigafaon of restriction enzyme sites used in constructing the vector. 
Base pairs 4050 - 4934 are the Japanese quail ovalbumin promoter (including 8DRE, steroid-dependent response 
element), The Japanese quae? ovalbumin promoter was isolated by its high degree of homology to the chicken 
ovalbumin promoter (GenBank accession number J00895 M24939, base pairs 431-1332). Some deletions were 
noted in the quail sequence, as compared to the chicken sequence. 
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Base pairs 4935-4942 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 4943 - 5092 are the Japanese quail ovalbumin signal sequence. The quail signal sequence was isolated 
by :ts high degree of homology to the chicken signal sequence (GenBank accession number J00S9S M24999, base 
pairs 2998-3148). Some deletions were noted in the quail sequence, as compared to the chicken sequence. 
Base pairs 5093-5098 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 5099 - 5991 are the DsRed reporter coding sequence, including poly.A sequence, from pDsRed 1,1 
(Clontech), corresponding to bp 1 00 - 992 of pDsRed 1,1. 

Base pairs 5992-5997 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 5993 - 6036 are part of the multiple cloning site of the ZeroBiunt Topo cloning vector (Invitrqgen), 
corresponding to base pairs 337-377 of ZeroBk-nt. 

Base pairs 6037-8042 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 6043 - 6475 are part of the multiple cloning site from pBiueseriptll sk(~), corresponding to bp 667-235 of 
pBiueseriptll sk(-). 

Base pairs 6476-6480 are a residue from ligation of restriction enzyme sites used in constructing the vector. 
Base pairs 6481 - 6550 are the 70 bp of the right insertion sequence recognized by the transposon Tn10, 
Base pairs 6551 - 6592 are non-coding X DNA that is residual from pNK2859. 
Base pairs 6593 - 7457 are non-coding DNA that is residual Irorn pNK2859. 

Base pairs 7458 - 9658 are from the pBiueseriptll sk{-) base vector (Stratagene. Inc,) s corresponding to base pairs 
761-2961 of pBiueseriptll sk(~). 

it should be noted that ail non-coding DNA sequences described above can be replaced with any other non-coding 
DNA sequence(s). 

EXAMPLE 5 

Transaction of Stage X Japanese Qua// Eggs with pTnMod(ovai/Red) - Quad via embryo injection 

[01 88] Transgenic Japanese quail were produced by transfecting Stage X embryos and the hehf ability of the transgene 
delivered by embryo transection was established. More specifically, fertile eggs were collected in the morning and 
placed at 15° G until enough were collected for injection, but were held no longer than 7 days. Stage X embryos (eggs) 
were assigned to one of two treatment groups. Before treatment, each egg was incubated on its side at room temperature 
for about 2 hours to allow the embryo to move to ' top dead center 1 " (TDC). Each egg was kansfected by drying a 1 mm 
hole (directly above the embryo) through the shell without penetrating the underlying shell membrane. :A 0.5 rni syringe 
fitted with a 28 gauge needle was used to deliver DNA compiexed to a transfecting reagent, i.e. SUPERFECTc^ in a 50 
volume An adhesive disc was used to seal the hole and provide a label for treatment identification After ail eggs 
were transf acted, they were set in an incubator with the adhesive disc pointing upward for hatching; 
[0187] Each bird that hatched was bled at one week of age f DNA was extracted from blood ceHs, and PCR was 
conducted using 28s primers as a positive control and primers specific to DsRed. Any bird mat was negative was 
terminated, while positive birds were monitored to determine maintenance of the transgene. Birds consistently positive 
were maintained until sexual maturity and bred. Positive male and female birds were mated. The eggs of mated hens 
were hatched and the resulting chicks, the G1 generation, were evaluated to determine if they were transgenic. Ail G1s 
resulting from this mating were bled and PCR conducted as described above. 

[0188] Egg injection: Two treatment groups and one control group were used for this experiment. Vector pT nMod 
{ Oval/Red) in supercoiied form (Treatment 1) and in linear form (Treatment 2) were used to transfeet 15 eggs per 
treatment. To obtain linear DNA for this experiment, pTnMod (Oval/Red) was digested with NgoM IV, column purified, 
and resuspended in TE buffer, 

[0189] Each egg was injected with 0.75 of DNA eompSexed with 8UPERFECT® in a 1:3 ratio in a total injection 
volume of 50 ^i Hank's Balanced Salt solution (HBSS) was used to bring the volume to 50 >L The DNA Superfect mixture 
must be allowed to incubate (for complex formation) at room temperature lor 10 minutes prior to injection and must be 
used within 40 minutes post initial mixing. Eggs were incubated as described above after injection. 
[0190] Results: In the superceded injection group, 2 females and 1 male were identified as PGR positive using primers 
specific to the DsRed coding sequence. These birds were mated as described above. Blood was taken from the G1 
chicks and PCR was conducted. The results showed that the transgene was incorporated into the gametes of these 
bh^Q. The G I chicks from these birds were examined on a weekly basis until it was verified that the gene was not present 
or enough transgenic G1s were obtained to initiate a breeding flock of fully transgenic bards. Eggs from these G1 chicks 
expressed DsRed protein in the albumin portion of their eggs. 
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EXAMPLE 6 

Intratasticular injection of Chickens with pTnMod(€MVf Red) (SEO ID NO: 2) 

[0191] Immature bsrds of different ages (4 ; 8, 8, 10 s 12, and 14 weeks) were placed under anesthesia and injected in 
the testes with the construct pTnMod(CMVZRed). A saline solution containing 1-5 of purified DMA vector, mixed with 
SU PERFECT® transfecting reagent (Qiagen, Valencia, OA} in a 1:8 (wi:voE) ratio. The volume of saline was adjusted 
so that the total volume injected into each testis was 150-200 jjsJ. depending on the age and size of the bird. For the 4- 
and 8-week~o3d chickens, 1 jxg DNA in 150 jjd was injected in each testis, divided into three? doses o\ 50 each. For 
the older birds, 200 juJ total volume was injected, containing either 3 jxg DNA (for 8-week-old birds) or 5 u.g DNA (for 
older birds) per testes. First, one testes was surgically exposed prior to injection. After enjection, the incision was sutured, 
and the sequence was repeated for the alternate testis. 

[0192] From six to nine months post-surgery, weekly sperm samples were taken from each injected bird : as weii as 
from control birds. Each sperm sample was evaluated lor uptake and expression of the injected gene. Samples were 
evaluated by PGR on whole sperm, within one week after eosiectson. 

[01 93} Approximately 1 00 male white leghorn chickens, in groups of 5-26, at ages 4 ? 6 f & 1 0, 12, and 1 4 weeks, were 
used as this is the age range in which \t is expected that the testes are likely to be most J< receptive.' 5 In this age range, 
the blood/testis barrier has not yet formed, and there is a relatively high number of spermatogonia relative to the numbers 
of other ceil types. e,g» f spermatids, etc. See J, Kumaran et al, 1949, Poultry ScL, vol 29, pp. 511-520. See also £. 
Oakberg, 1958. Am. J. Anatomy, vol 99, pp. 507-515; and P. Kluin et aL, 1984. Anat. EmbryoL, vol. 169, pp. 73-78. 
[01 94] The experimental and control males were obtained from commercial sources at one day of age ? and maintained 
in brooders until used. The male birds were housed in temperature-controlled spaces if? individual standard caging as 
they approached maturity. They were given water and standard commercial feed ad Hb. They were kept initially in a 23: 
1 hour light/dark cycle, stepped down at approximately weekly intervals to a 1 5:8 hour Ssght/dark cycle, as this regimen 
has been reported to optimize sexual maturity and fertility. 

[0195] At the appropriate ages, groups of individual males were starved overnight and then subjected to transgene 
delivery by direct antratestecular injection of DNA by experienced animal surgeons. Each male was anesthetized with 
isoflurane via a simplified gas machine, 

[0196] Various devices and anesthesia machines have previously been described for administering ssoflurane (and 
other gaseous anesthetics) to birds. See -AEsage et a!.„ Poultry ScL, 50:1876-1878 (1971): Greeniees et aL, Am. J. Vet. 
Res., vol. 51, pp 757-758 (1990). However, these prior techniques are somewhat cumbersome and complex to imple- 
ment. A novel and much simpler system to administer isoflurane (or other gaseous) anesthesia was developed due to 
the deficiencies in the prior art, a system that we found worked well on all ages of chicks, A standard nose cone was 
placed over the chick's head, similar to the system that has been used for decades to administer ether to mice. A plastic 
tube approximately 3.5 cm sn diameter and 12 cm long was fitted with cotton, into which was poured approximately 2 
rnL isoflurane (Abbott Laboratories, Chicago). The chick's head was placed partially into the cylinder, and was held in 
piace there intermittently throughout the surgery as required to maintain the proper plane of anesthesia, without over- 
dosing. 

[0137] Each anesthetized bird was positioned on ats side on an animal board with cords tractioning the wings and feet 
to allow access to the testes area. The area was swabbed with 0,5% chlorhexidine, and a 2 cm dorsolateral incision 
was made in the skin over the testis (similar to the procedure commonly used for caponization). A small-animal retractor 
was used to spread the last two ribs ; exposing the testis. The DNA solution was then mixed with SU PERFECT® (Qiagen) 
according to the manufacturer's protocol approximately a 1:8 wl/vol ratio, to a final concentration of 0,01 - 0.05 ^g/fii. 
This resulted in 1 - 5 p.g total DNA (in a 150-200 ^1 volume) being injected into each testis, spread over three injection 
sites: one at each end of the testis, and one in the middle. 

[0198] The injection device was a standard 25 gauge, 1/2 inch {1.27 cm) hypodermic needle, attached to a 50 s 100 ; 
or 200 jxi syringe. Approximately 5 mm of the needle tip was bent at a 90 degree angle* to facilitate insertion into the 
testes. Approximately 50 - 70 |J of the DNA-3U PERFECT® solution was anjected into each of three sites per testis. The 
multiple injections were calculated to suffuse the DNA throughout the whole testis, the idea being to promote contact 
between DNA and spermatogonia as much as feasible. We estimated that our procedure resulted in the injection of 
about 1 00,000 DNA molecules per spermatogonium, The construct used in these tests was a highly potent constitutive 
modified CMV promoter, operative^ linked to the dsRed gene as shown in SEQ ID NO:2. 

[0199] Following injection, the incision was closed in two layers with 4-0 absorbable suture, and then the contralateral 
testis was similarly exposed and injected. Following surgery, each bird was returned to its cage to recover. One hundred 
thirteen males were ultimately used in the experimental regimen to increase the overall likelihood of success, along with 
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4 control birds (16 weeks 20 old) subjected to sham surgery (witfv injections containing only the transfection reagent. 
Evaluation of Birds 



6 [0200] Thus, a total of 113 white leghorn chickens were injected with the DMA vector in groups of 5-28 at varying 
ages Fourteen birds were transformed at A weeks, 23 birds a! 6 weeks; 28 birds at 8 weeks: 23 birds at 10 weeks; 5 
birds at 12 weeks; and 22 birds at 14 weeks. Sixteen birds died before they could be sampled, so to date, 97 roosters 
have been sampled ; plus the four controls Birds were evaluated at 1 8-24 weeks of age for {a} "potential ;tr|ns^;rp|trpn 
in the sperm, and (b) successful testis transfeetieh. Sperm samples were obtained from each rooster by mahuai rhani p- 

10 ulattoh using standard techniques The sperm were washed, and their DMA was extracted knowing the techniques of 
G. Mann eta!,; 1993. J. Reprod. Fort., 99:505-1 2. The samples were then frozen until analyzed. Evaluation was conducted 
by PGR analysis to detect DMA integration into the sperm, or into any of the testicular cells. Additionally, selected testes 
were harvested at the end of the sperm sampling period. 

[0201] Of 97 birds tested, at least 22 showed probable positive results. Positive results were observed at all transfer- 

tis mation ages, except for 4 weeks, which was not tested. At least two birds were confirmed positive by PGR of sperm, 
conducted four months after the initial injection. These results were transient in many oases, however since It was 
believed that the PsRed gene product used in these initial proof of concept experiments was toxic. Nevertheless, the 
positive PGR results presumptively demonstrated that the transgene was incorporated into spermatogonia (before pu- 
berty), and that it was carried in transgenic sperm. Such sperm could then transmit the gene to subsequent generations, 

20 resulting in the production of true, germ-line transgenic "founder" birds. 

[0202] To further confirm that the DNA had been incorporated into the sperm, and that contaminating vector was not 
being detected from other sources, it was confirmed through PGR on sperm of experimental birds, and on positive and 
negative controls that the sperm of the experimental birds lacked DMA encoding the transposase. The design of the 
preferred transposon based vector is such that the sequence encoding the transposase is ; : e'ohtajhed ^8iii;ili5i^||3Mt- 

25 is not incorporated into the transformed chromosome. Thus, presence of the exogenous coding sequence, coupled with 
absence ot the transposase genesis strong evidence for incorporation of the exogenous coding sequence^ or transgene, 
[0203] These results demonstrated proof of concept as positive PGR results were obtained from the sperm of treated 
birds. Interpretation of these preliminary results was made more difficult by the fact that the modified GMV promoter 
used in the experiment was probably too "hot." As the DsRed product is not secreted f ram the cells the product built up 

30 intraceHtetarly to levels that were toxic, frequently killing the ceils. Even this result, of course, means that the transformation 
was successful. The transgene couid not have killed the ceils otherwise, 

[0204] In order to resolve to the problem with toxicity of the DsRed gene product, experiments were conducted using 
a different reporter gene operabsy linked to the ovalbumin promoter, so that the transgene was expressed in the egg 
white. These experiments are provided in Examptes12- 15 below. 

35 

EXAMPLE 7 

Transfection of Male White Leghorn Chickens Using the Vector pTnModfOvai/ Red) - Quasi (SEQiD NO: 4) via Testicular 
injections 

40 

[0205] In further experiments conducted on ieghorn chickens, it was demonstrated that chickens injected intratestic- 
uiariy at 8, 10, 12, or 14 weeks of age, had, on average, approximately 40% positive sperm between 6 and 8 months 
after injection, in other experiments, successful transfection was achieved with chickens injected at 13 weeks of age. 
[0206] Forty-nine white ieghorn roosters approximately 8 ; 10, 1 2 f or 14 weeks of age were obtained and housed. Birds 

45 were identified, wmg banded, and assigned to a treatment group, if appropriate (based on testes size and vascularization), 
one testis was caponized and the entire DNA injection volume was delivered to the remaining testis. Thirty-two mates 
received DNA injections of 5^g DNA/testis at a 1:3 ratio of DNA to SUPERFECT® The remaining birds were used as 
controls. After injection, ail birds were mated with at least 5 females and observed until sexual maturity and egg- laying 
began. All eggs collected prior to peak egg production {approximately 24 weeks of age for the hens) were incubated 

50 and candled to determine embryo presence Any embryos identified were incubated to hatch to extract DNA, PGR was 
conducted, and transgene presence was determined, 

[0207] Roosters positive for the pT n^od{Oval/Red) - Quail construct were kept to produce Ft offspnng (eggs collected 
at peak production}. Offspring from this hatgh w§re bled, DNA extracted from the blood, and PC R conducted using 
primers specific for the DsRed gene, it was determined that 77% of the offspring were transgenic. 

55 

EXAMPLE 8 



33 



EP 1 539 785 B1 



Transi action of Mature Male Japanese Quail using the vector pTnMod{Qvai; Red) - Quail (SEQ ID NO:4) via Testicular 
injections 

[0208] Twelve sexually mature mates (at approximately 13 weeks of age) underwent surgery for testicular injection 
as described above for chickens. At 21-28 days of age, the birds were identified, leg banded, debeaked, and separated 
based on sex. Injections comprised 5 ^g/testes of the vector in concentrations 1:3 or 1:10 for SUPERFECT® or a 1:1 
ratio with Mirrus. The study consisted of 3 treatment groups with 5 rnases in the 1 :3 DNA:SU PERFECT® group, 3 males 
in the 1:10 DNA:SUPERFECT® group, and 4 mates in the 1:1 Mirrus group. AW surgeries were conducted in one day. 
[0209] Any unincorporated DN A was allowed to clear from the testes by holding the birds for 13 days before mating 
with females. At 15 weeks of age, 2 age-matched females were housed with each treated male. The presence of the 
transfeeted DN A was determined in the fertfeed eggs during the second week of egg ?ay. The subsequent eggs collected 
from parents producing positively identified transgenic eggs were collected and stored until taken to hatch. 
[021 0] PGR performed on the sperm of quail injected at three months of age indicated successful incorporation of the 
DsRed transgene into the quaN sperm. 

EXAMPLE 9 

Transfection of immature Male Japanese Quail using the vector pTnMod(ovali Red) - Quaii (SEQ ID NO A) via Testicular 
Injections 

[021 1 ] Approximately 450 quaii eggs were set and hatched. At 21 -28 days of age, the birds were identified, winghanded, 
debeaked, and separated based on sex. At 4 weeks of age, 85 rnaie birds underwent surgery and testicular injections 
as described above. Injections comprised -a control and 2 ^g/testes of the vector in varying concentrations (0, 1/3, 1/5, 
and 1/10) of three different transfection reagents: 1) SUPERFECT® 2) Mfrus/Panvera and 3} Dospen The study com- 
prised 13 treatment groups with 5 males per group. One transfection reagent was administered per day, 
[021 2] At 7 weeks of age, 2 age-matched females were housed with each treated male. The presence of the transfected 
DNA was determined in the fertilized eggs during the second week of egg fay. The subsequent eggs collected from 
parents producing positively identified transgenic eggs were coiiected and stored until taken to hatch. PGR performed 
on the sperm of quaii injected at four and five weeks of age indicated successful incorporation of the DsRed transgene 
into the quail sperm. 

EXAMPLE 10 

Preparation of Transposon-Based Vector pTnMod(ovas> ENT TAG/pf4&PA) - Chicken 

[0213] A vector is designed for inserting a p146 gene under the control of a chicken ovalbumin promoter, and a 
ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NQ:29. 
[G214] Base pairs 1-130 are a remainder of F1{-) ori of pBlueScriptli sk(~) (Stratagene) corresponding to base pairs 
1-130 of pBiueschptH sk(-). 

[021 S] Base pairs 133 ~ 1777 are a CMV promoter/enhancer taken from vector pGWiz {Gene Therapy Systems) 
corresponding to base pairs 229-1873 of pGWiz. 

[8216] Base pairs 1780 - 2987 are a transposase, modified from Tn10 (GenBank accession number J01829), 
[0217] Base pairs 2988-2993 are an engineered stop codon. 

[0218] Base pasrs 2995 - 3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base 
pairs 1922- 2337 of pGWiz. 

[021 9] Base pairs 341 5 - 371 8 are non coding DNA that is residual from vector pNK2859. 
[0220] Base pairs 3719 - 3761 are X, DNA that is residual from pMK2859, 

[0221 ] Base pairs. 3762 - 3831 are the 70 base pairs of the left insertion sequence {ISi 0) recognized by the transposon 
Tn10. 

[0222] Base pairs 3838 - 4044 are a multiple doning site from pBlueScriptll sk{-) corresponding to base pairs 924-71 8 
of pBlueschptll sk(-). 

[0223] Base pairs 4050 - 4951 are a chicken ovalbumin promoter (including SORE) that corresponds to base pairs 
431-1332 of the chicken ovalbumin promoter in GenBank Accession Number JG0895 M24999. 

[0224] Base pairs 4958 - 81 1 5 are a chicken ovalbumin signai sequence and Ovalbumin gene that correspond to base 
pairs 86-1223 of GenBank Accession Number V00383.1 (The STOP codon being omitted). 

[0225] Base pairs 6122 - 6271 are a TAG sequence containing a gp41 hairpin loop from HiV I an enterokinase 
cleavage site and a spacer (synthetic), 

[0226] Base pairs 8272 - 6316 are a p146 sequence (synthetic) with 2 added stop codons. 
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[0227] Base pairs 6324 - 66/8 are a synthetic paly adenyiation sequence from pGWiz (Gene Therapy Systems) cor- 
responding to base pairs 1920 - 2272of pGWiz. 

[022B] Base pairs 6882 * 71 14 are a multiple cloning site from pBlueScriptH sk(-) corresponding to base pairs 687-235 
of pBjuescripiij sk{-}, 

[0229] Base pairs 7 1 20- 71 89 are the 70 base pairs of the right insertion sequence (IS 1 0) recognized by the transposon 
Th10. 

[G23G] Base pairs 7190-7231 are X DNA that is residual from pNK2859. 

[0231] Base pairs 7232 - 8098 are non coding DNA that is residua! from pNK2859, 

[0232] Base pairs S097 - 10297 are pBlueSeript sk(-) base vector (Stratagene, inc.) corresponding to base pairs 
781-2981 of pBhjescripfll sk(~), 

[0233] It should be noted thai ail non-coding DNA sequences described above can be replaced with any other non- 
coding DNA sequence(s), Missing nucleotide sequences in the above construct represent restriction site remnants. 

EXAMPLE 11 

Preparation of Transposon-Based Vector pTnMod(Oval/ ENT TAG/p146/PA) - Quail 

[Q234J A vector is designed for inserting a p1 48 gene under the control of a quail ovalbumin promoter, and a ovalbumin 
gene including an ovalbumin signal sequence, into the genome of a bird given below as S.EQ-JO NO:30. 
[0235] Base pairs 1-130 are a remainder of Ft(-) ori of pBluescriptlf sk(~) (Stratagene) corresponding to base pairs 
1 -1 30 of pBsuescnptll sk(-). 

[0236] Base pairs 133 - 1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) 
corresponding to base pairs 229-1873 of pGWiz, 

[0237] Base pairs 1780 - 2987 are a transposase, modified from Tn10 (QenBank accession number J0 1829); 
[G238] Base pairs 2988-2993 are an engineered stop codon. 

[0239] Base pairs 2995 - 3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base 
pairs 1 922-2337 of pGWiz. 

[0240] Base pairs 341 5 - 371 8 are non coding DNA that is residual from vector pNK2859. 
[0241] Base pairs 3719 - 3761 are X DNA that is residuai from plMK2859. 

[0242] Base pairs 3762 - 3831 are the 70 base pairs of the left insertion sequence (IS 1 0) recognized by the transposon 

Tn10. 

[0243] Base pairs 3838 - 4044 are a multiple cloning site from pBlueScriptH sk(-) corresponding to base pairs 924-71 8 
of pBEueseriptH sk(-). 

[0244] Base pairs 4050 - 4938 are the Japanese quail ovalbumin promoter (including SDRE, steroid-dependent re- 
sponse element}. The Japanese quml ovalbumin promoter was isolated by its high degree of homology to the chicken 
ovalbumin promoter {GenBank accession number JG0895 M24999, base pasrs 431-1332). 

[0245] Bp 4945 - 6092 are a quail ovaibumin signal sequence and ovalbumin gene that corresponds to base pairs 54 
- 1201 of GenBank accession number X53964. 1 . (The STOP codon being omitted). 

[0246] Base pairs 8097 - 6246 are a TAG sequence containing a gp41 hairpin loop from HIV \, an enlerokinase 
cleavage site and a spacer (synthetic). 

[0247] Base pairs 6247 - 8291 are a p146 sequence (synthetic) with 2 added stop codons. 

[0248] Base pairs 6299 - 6851 are a synthetic polyadenyiation sequence from pGWiz (Gene Therapy Systems) cor- 
responding to base pairs 1920 - 2272of pGWiz. 

[0249] Base pairs 8657 - 7089 are a multiple cloning site from pBlueScriptfi sk(-) corresponding to base pairs 667-235 
of pBluescriptlE sk(-). 

[0250] Base pairs 7095- 71 64 are the 70 base pairs of the right insertion sequence (!St 0) recognized by the transposon 
"in 10. 

[0251] Base pairs 7185 - 7208 are X DNA that is residuai from pMK2859. 

[0252] Base pairs 7207 - 8071 are non coding DNA that is residual from pNK2359. 

[0253] Base pairs 8072 - 10272 are pBiueScript sk(~) base vector (Stratagene, Inc.) corresponding to base pairs 
761-2961 of pBiuescriptil sk(- ). 

[0254] it should be noted that all non-coding DNA sequences described above can be replaced with any other non- 
coding DNA sequence (s). Missing nucleotide sequences in the above construct represent restriction site remnants. 
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EXAMPLE 12 

Preparation of Transposon-Based Vector p TnMod(Qvai/ENT TAG/ Prof ns/ PA) - Chicken 

[0255] A vector is designed for inserting a proinsuiin gene under the contrcH of a chicken ovalbumin promoter, and a 
ovalbumin gene including an ovalbumin, signal sequence, into the genome of a bard given below as SEQ ID NO:31. 
[0258] Base pairs 1 - 130 are a remainder of Fi{~) oh of pBlueseriptiS sk(-) (Stratagene) corresponding to base pahs 
1-130 of pBJuescriptli sk(-). 

[0257] Base pairs 133 - 1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) 
corresponding to base pairs 229-1873 of pGWiz. 

[0258] Base pairs 1780 - 2987 are a transposase, modified from Tn10 (GenBank accession number JG1829). 
[0259] Base pairs 2988-2993 are an engineered stop codon, 

[0280] Base pairs 2995 - 3410 are a synthetic polyA from pG Wis (Gene Therapy Systems) corresponding to base 
pairs 1 922- 2337 of pGWiz, 

[0261] Base pairs 3415 - 3718 are non coding DNA that is residual from vector pNK2859. 
[0262] Base pairs 3719 - 3761 are X DNA that is residual from pNK2859. 

[0263] Base pairs 3762 - 3831 are the 70 base pairs of the left insertion sequence OS 1 0) recognized by the transposon 
Tn10. 

[0264] Base pairs 3838 - 4044 are a multiple cloning site from pBlueScriptil sk{~) corresponding to base pairs 924-71 8 
of pBiuescriptH sk(-). 

[0265] Base pairs 4050 - 4951 are a chicken ovalbumin promoter (including SORE) that corresponds to base pairs 
431-1332 of the chicken ovalbumin promoter in GenBank Accession Number J00895 M24999. 

[0266] Base pairs 4958 -6115 are a chicken ovalbumin signal sequence and ovalbumin gene that correspond to base 
pairs 68-1223 of GenBank Accession Number V00383.1. (The STOP codon being omitted). 

[0287] Base pairs 8122 - 8271 are a TAG sequence containing a gp41 hairpin loop from HIV L an enterokinase 

cleavage sste and a spacer (synthetic). 

[0288] Base pairs 8272 - 8531 are a proinsulin gene. 

[0269] Base pa^rs 6539 - 6891 are a synthetic poiyadenyfation sequence from pGWiz (Gene Therapy Systems) cor- 
responding to base pairs 1920 - 2272o! pGWiz, 

[0270] Base pairs 8897 - 7329 are a multiple cloning site from pBlueScriptH sk(-) corresponding to base pairs 687-235 
of pBiuescriptl! sk(-). 

[0271] Base pairs 7335- 7404 are the 70 base pairs of the right insertion sequence (IS 1 0} recognized by the transposon 
Trim 

[0272] Base pairs 7405 - 7446 are X DNA that is residual from pNK2859. 

[0273] Base pairs 7447 - 831 1 are non coding DNA that is residual from pNK2859. 

[0274] Base pairs 8312 - 10512 are pBiueSchpt sk{~) base vector (Stratagene, inc.) corresponding to base pairs 
761-2961 of pBluescriptl! sk(-), 

[0275] It should be noted that all non-coding DNA sequences described above can be replaced with any other non- 
coding DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants. 

EXAMPLE 13 

Preparation of Transposon-Sased Vector pTnModfQva&E'NT TAGIProln&lPA) - Quail 

[0276] A vector is designed for inserting a proinsulin gene under the control of a chicken ovalbumin promoter, and a 
ovalbumin gene including an ovalbumin signaS sequence, into the genome of a bird given below as SEQ ID NO:32. 
[Q277] Base pairs 1 -130 are a remainder of F1{-) ori of pBfueschptli sk{-) (Stratagene) corresponding to base pairs 
1-130 of pBluescriptH sk(-). 

[D278] Base pairs 133 - 1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) 
corresponding to base pairs 229-1873 of pGWiz. 

[0279] Base pairs 1780 - 2987 are a transposase, modified from Tn10 (GenBank accession number JG1829), 
[0280] Base pairs 2988-2993 are an engineered stop codon, 

[8281] Base pairs 2995 - 3410 are a synthetic poiyA from pGWiz (Gene Therapy Systems) corresponding to base 
pasrs 1922- 2337 of pGVfe 

[0282] Base pairs 3415 - 371 8 are non coding DMA that is residual from vector pNK2859, 
[0283] Base pairs 371 9-3781 are X DNA that is residual from pNK285& 

[0284] Base pairs 3782 - 3831 are the 70 base paars of the left insertion sequence (!S 1 0} recognized by the transposon 
Trim 
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[0285] Base pairs 3838 - 4044 are a multiple cloning site from pBlueSeriptil sk{-} corresponding to base pairs 924-718 
of pBiuescripall sk(~). 

[0286] Base pairs 4050 - 4938 are the Japanese /quail -ovalbumin promoter (including SDRE, steroid-dependent re- 
sponse eiement) The Japanese quaff ovalbumin promoter was isolated by its high degree of homology to the chicken 
s ovalbumin promoter (GertBank accession number JGQ895 M24999, base pairs 431 -1332), Some deletions were noted 
sn the quail sequence, as compared io the chicken sequence. 

[0287] Base pairs 4945 - 8092 are a quail ovalbumin signal sequence and ovalbumin gene that corresponds to base 
pairs 54 - 1201 of GenBank accession number X53964.1. (The STOP codon being omitted). 

[0288] Sase pairs 6093 - 6246 are a T AG sequence containing a gp41 hairpin loop from H3 V I an enteroksnase cleavage 
io site and a spacer (synthetic). 

[028S] Base pairs 6247 - 650? are a proinsulin gene. 

[0290] Base pairs 6514 - 6866 are a synthetic polyadenylation sequence from pQWi-z (Gene Therapy Systems) cor- 
responding to base pairs 1920 - 2272ot pGWiz. 

[0291] Base pairs 6S67 - 7303 are a multiple cloning site from pBiueSaipiM sk(-) corresponding to base pairs 867-235 
of pBluescriptl I sk(-T 

[0282] Base pairs 7304- 7379 are the 70 base pairs of the right insertion sequence (IS 10) recognized by the fransposon 
TnlG. 

[0293] Base pairs 7380 - 7421 are X DMA that is residual from pNK2859. 
[0294] Base pairs 7422 - 8286 are non coding DNA that is residual from pNK285& 
so [0295] Base pairs 8287 ~ 10487 are pBiueScnpt sk{-) base vector (Stratagene, Ine,) corresponding to base pairs 
781-2981 of pBkjescripiH sk(-), 

[0298] It should be noted that aH non-coding DNA sequences described above can be replaced with any other non- 
coding DNA sequencers), Missing nucleotide sequences in the above construct represent restriction site remnants. 
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Tr&nsfection of immature Leghorn Roosters using a Transpson-based Vector containing a Proinsulin Gene via Testicular 
injections 

30 [0297] Vectors containing the elements Ovai promoter/Ova? gene/GP41 Enterokinase TAG/Pro;nsuiin/Poiy A (SEQ 
ID NO 31 ) and CMV promoter/Oval gene/GP41 Enterokinase TAG/Proinsul in/Poly A (SEQ ID NO:42) were each injected 
into the testes of 1 1 week old white Leghorn roosters These birds were held under normal conditions unlH sexual maturity 
■ was reached. 

[0298] At the time of sexual maturity, each bird was handled and manipulated to obtain sperm. Sperm samples were 
35 collected in Hank's Buffered Salt Solution (HBSS) and stored at either -20° G or 4° C until needed. DNA was extracted 
from sperm using a MoBio Ultra Clean DNA Bloodspin Kit (Mo Bio laboratories, Solana Beach G A). Fifty microliters of 
sperm was used In the DNA extraction protocol and the purified genomic DNA eluied in 100 jjJ of ivater. in each PGR 
reaction, approximately 0,5 - 0,75 ^g of genomic DNA was used with primers anchored in the entag- 1 (5'} and the 
synthetic polyA-2 (3 ; ), which amplify a 685 bp fragment. Five of nine birds gave positive reactions for the presence of 
40 the appropriate vector construct. These birds were then mated with normal females, 

[0299] Birds that did not yield positive results with PGR on the sperm were sacrificed, their testes removed, and DNA 
extracted using an approximately 25 mg piece of tissue in a Qiagen DNEasy Tissue Kit; purified DNA was eluted in 200 
fjil water and PGR conducted as described above. Two of these birds gave a very strong, positive PGR reaction. 

'5 EXAMPLE 15 

Transfection of Japanese Quail using a Transposon-bas&d Vector containing a Proinsulin Gene via Oviduct Injections 

[0300] Two experiments were conducted in Japanese quail using transpson-based vectors containing either Oval 
so promoter/Oval gene/GP41 Enterokinase TAG/Froinsulin/Po§y A (SEQ ID NO:31) or CMV promoter/Oval gene/GP41 
Enterokinase TAG/Proinsuiin/Poiy A (SEQ ID MO:42). 

[0301] in the first experiment, the Oval promoter/Ova! gene/GP41 Enterokinase TAG/Proinsuhn/Poiy A Gontajning 
construct was injected into the oviduct 6! sexually mature quail; three hens received 5 jxg at a 1 :3 Superfeet mtk) and 
three received 10 *a.g at a 1:3 Superfeet ratio; As of the writing of Ihe present application, at least one bi rd that received 
55 to jxg of DNA was producing human proinsuiin in egg white (other birds remain to be tested}. This experiment indicates 
that 1} the DNA has been stable for at least 3 months: 2) protein levels are comparable to those observed with a 
constitutive promoter such as the CMV promoter; and 3) sexually mature birds can be injected and results obtained 
without the need for cell culture. 
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[0302] In the second experiment the transposon-based vector containing CMV promoter/Oval gene/GP41 Enteroki- 
nase TAG/Proinsulin/Poiy A was injected Into the oviduct of sexually immature Japanese quail A totes of 9 birds were 
injected. Of the 8 survivors, 3 produced human proinsulin In the white of their eggs for over 6 weeks. An EUSA assay 
described in detail below was developed to detect GP41 in the fusion peptide (Ova! gene/GP41 Enterokinase TAG/ 
ProinsuHn) since the GP41 peptide sequence is unique and not found as part of norma! egg white protein, in a!! EUSA 
assays, the same birds produced positive results and all controls worked as expected. 

[0303] ELiSA Procedure: individual egg white samples were diluted in sodium carbonate buffer, pH 9.8. and added 
to individual wells of 96 mil rnicrotiter ELISA plates at a total volume of 0.1 mL These plates were then allowed to coat 
overnight at 4°C. Prior to EUSA development, the plates were allowed warm to room temperature. Upon decanting the 
coating solutions and blotting away any excess, non-specific binding of antibodies was blocked bv adding a solution of 
phosphate buffered saline {PBS}, t% (w/v) BSA, and 0.05% (v/v) Tween 20 and allowing it to incubate with shaking for 
a minimum of 45 minutes. This blocking solution was subsequently decanted and replaced with a solution of the primary 
antibody (Goat Anti-QP41 TAG) diluted in fresh PBSBSA/Tween 20. After a two hour period of incubation with the primary 
antibody, each plate was washed with a solution of PBS and 0.05% Tween 20 in an automated plate washer to remove 
unbound antibody. Next, the secondary antibody. Rabbit anti-Goat Alkaline Phosphatase-eonjugated, was diluted in 
P BSBSA/T ween 20 and allowed to incubate 1 hour. The plates were then subjected to a second wash with PBS/Tween 
SO. Antigen was detected using a solution of p-Nitropheny? Phosphate in Diethanolamine Substrate Buffer for Alkaline 
Phosphatase and measuring the absorbance at 30 minutes and 1 hour, 

0 EXAMPLE 16 

Optimization of fntra-oviduct and intra-ovarian Arterial injections 
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[03Q4] Overall transfection rates of oviduct ceils m a f lock of chicken or quaM hens are enhanced by synchronizing the 
development of the oviduct and ovary within the flock. When the development of the oviducts and ovaries are uniform 
across a group of hens and when the stage of oviduct and ovarian development can be determined or predicted, timing 
of injections is optimized to transieet the greatest number of cells. Accordingly, oviduct development is synchronized as 
described below to ensure that a large and uniform proportion of oviduct secretory ceils are fransfected with the gene 
of interest. 

[0305] Hens are treated with estradiol to stimulate oviduct maturation as described in Oka and Schimke (T. Oka and 
RT Schfmke, j\ Cell BioL, 4t s 818 (1969)), Patmiten Cbnstensen and Schimke (J Biol. Chem, 245(4):833*845, 1970). 
Specifically, repeated daily injections of 1 mg estradiol benzoaie are performed sometime before the onset of sexual 
maturation, a period ranging from 1-14 weeks of age. After a stimulation period sufficient to maximize development of 
the oviduct, hormone treatment is withdrawn thereby causing regression in oviduct secretory cell size but not cell number. 
At an optimum time after hormone withdrawal, the oviducts of treated hens are injected with the fransposon-based 
vector. Hens are subjected to additional estrogen stimulation after an optimized time during which the transposon-based 
vector is taken up into oviduct secretory ceils. Re-stimuiation by estrogen activates the transposon -mechanism of the 
transposen-based vector, causing the integration of the gene of interest into the host genome. Estrogen stimulation is 
then withdrawn and hens continue normal sexual development. If a developmental regulated promoter such as the 
ovalbumin promoter is used, expression of the transposon-based vector init safes in the oviduct at the time of sexual 
maturation, intra-ovarian artery injection during this window allows for high and uniform transfection efficiencies of ovarian 
follicles to produce germ-line transfections and possibly oviduct expression, 

[0306] Other means are also used to synchronize the development, or regression, of the oviduct and ovary to allow 
high and uniform transfection efficiencies. Alterations of lighting and/or feed regimens, for example, cause hens to s moir 
during which time the oviduct and ovary regress. Molting is used to synchronize hens for transfection, and may be used 
in conjunction with other hormonal methods to control regression and/or development of the oviduct and ovary. 

EXAMPLE 17 

isolation ofHuman Protnsulin Using Anti-TAG Column Chrornotography 

[0307] A HtTrap NHS-activated 1 roL column (Amersham) was charged with a 30 amino acid pepme that contained 
the gp-41 epitope containing gp~41*s native disulfide bond that stabilizes the formation of the-gp-41 hairpin loop. The 
30 amino acid gp41 pepMe Is provided as SEQ ID NO:23. Approximately 1 0 mg of the peptide was dissolved in coupling 
buffer (0.2 M NaHCOS, Q.5 M NaCI, pH 8.3 and the legend was circulated on the column for 2 hours at room temperature 
at 0,5 ml/minute. Excess active groups were then deactivated using 6 column volumes of 0.5 M ethanoiamine, 0.5 M 
NaCI, pH 8.3 and the column was washed alternately with 6 column volumes of acetate buffer (0.1 M acetate, 0.5 M 
NaCI, pH 4„D) and ethanolamine (above). The column was neutralized using 1 X PBS. The column was then washed 
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with buffers to be used in affinity purification* 75 imM Ths. pH 8,0 and elut?on buffer, 100 mM Qlycine-HGI. 0.5 M NaCI, 
pH 2.7. Finally, the column was equilibrated in 75 mM Ths bulfer. pH 8.0. 

[0308] Antibodies to gp-41 were raised sn goats by inoculation with the gp~41 peptide deschbed above. More specif icajjy, 
goats were inoculated, given a booster injection of the gp-41 peptide and then bsed. Serum was harvested by centrrf li- 
gation. Approximately 30 ml of goat serum was filtered to 0.45 uM and passed over a TAG column at a rate el 0 5 
ml/mifY, The column was washed with 75 mM Tns, pH 8 0 until absorbance at 280 nm reached a baseline. Three column 
volumes (3 mL} o! elution buffer (100 mM glycine, Q.5 fvl IMaCI, pH 2.7) was applied, followed by 75 rnM The buffer, pH 
8 G» all at a rate of 0.5 mL/min One milliter fractions were collected. Fractions were collected into 200 uL 1 U Ins, pH 
9.0 to neutralize acidic factions as rapidly as possible. A large peak eiuted from the column, coincident with the application 
the elution buffer. Fractions were pooled. Analysis by SOS PAGE showed a high molecular weight species that separated 
into two fragments under reducing condition, in keeping with the heavy and light chain structure of lg€L 
[0303] Pooled antibody fractions were used to charge two 1 mL HIT rap NHS-activated columns, attached in series. 
Coupisng was earned out in the same manner as that used for charging the TAG column. 

[0310] Egg white from quail and chickens treated by imra-oviduct injection of the CMV-ovalbumin-TAG-proinsuiln 
construct were pooled. Viscosity was lowered by subjecting the aliantoid fluid to successively finer pore sizes using 
negative pressure filtration, finishing with a 0.22 \>M pore size. Through the process, egg white was diluted approximately 
1:18. The clarified sample was loaded on the AntrTAG column and eiuted in the same manner as described lor the 
purification of the anti-TAG antibodies. A peak of absorbance at 280 nm, coincident with the application of the elution 
buffer, indicated that protein had been specifically eiuted from the Anti-TAG column. Fractions containing the eiuted 
peak were pooled for analysis. 

[0311] The pooled fractions from the Anti-TAG affinity column were characterized by SDS-PAQE and western blot 
analysis. 3D3-PAGE of the pooled fractions revealed a 60 kDal molecular weight band not present in control egg white 
fluid, consistent with the predicted molecular weight of the transgenic protein. Although some contaminating bands were 
observed, the 80 kDal species was greatly enriched compared to the other proteins. An aliquot of the pooled fractions 
was cleaved overnight at room temperature with the protease, enterokinase. SDS-FAGE analysis of the cleavage product, 
revealed a hand not present In the uncut material that co-migrated with a commercial human proinsulin positive control. 
Western blot analysis showed specific binding to the 60 kDal species under non-reducing condition (which p rose n/e the 
hairpin epitope of gp~41 by retaining the disulfide bond). Western analysis of the low molecular weight species that 
appeared upon cleavage with an anti-human proinsulin antibody, conclusively identified the cleaved fragment as human 
proinsuBIn, 



EXAMPLE 18 

Construction of a Transposon-based Transg&ne for the Expression of a Monoclonal Antibody 

[0312] Production of a monoclonal antibody using transposojvbased transgenic methodology is accomplished in a 
variety of ways. 

1 ) two vectors are constructed: one that encodes the light chain and a second vector that encodes the heavy chain 
of the monoclonal antibody. These vectors are then incorporated into the genome of the target animal by at least 
one of two methods: a) direct transaction of a single animal with both vectors (simultaneously or as separate events); 
or, b) a male and a female of the species carry in their germline one of the vectors and then they are mated to 
produce progeny that inherit a copy of each . 

2) the light and heavy chains are included on a single DNA construct, either separated by insulators and expression 
is governed by the same (or different) promoters, or by using a single promoter governing expfessibn of both 
transgenes with the inclusion of elements that permit separate transcription of both transgenes, such as an internet 
rsbosoroe entry site, 

[0313] The following example describes the production of a transposon-based DMA construct that contains both the 
coding region for a monoclonal light chain and a heavy chain on a single construct. Beginning with the vector pTnMod, 
the coding sequences for the heavy and light chains are added, each preceded by an appropriate promoter and signal 
sequence. Using methods known to one skilled m f he art, approximately 1 Kb of the proximal ejements of the ovalbumin 
promoter are linked to the signal sequence of ovalbumin or some other protein secreted from -he target tissue. Two 
copies of the promoter and signal sequence are added to the multiple cloning site of pTnMod. leaving space and key 
restriction sites between them to allow the subsequent addition of the coding sequences of the Sight and heayy chains 
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of the monoclonal antibody. Methods known to one skilled in the art allow the cooing sequences of the light and heavy 
chains to be inserted in-frame for appropriate expression. For example, the coding sequence of light and heavy chains 
of a murine monoclonal antibody that show specificity for human seminoprotein have recently been disclosed (GenBank 
Accession numbers AY 129008 and AY 129304 for the light and heavy chains, respectively). The eight chain cDNA 
sequence is provided in SEQ-ID NO;34 t whereas the cDNA of the heavy chain is reported as provided in SEQ ID NO:35. 
[0314] Thus one skilled in the art can produce both the heavy and light chains of a monoclonal antibody in a single 
ceil within a target tissue and species. If the modified cell contained normal posttransiational modification capabilities, 
the two chains wouid form their native configuration and disulfide attachments and be substrates for glycosyiatioa Upon 
secretion, then, the monoclonal antibody is accumulated* for example, in the egg white of a chicken egg, if the f ransgenes 
are expressed in the magnum of the oviduct 

[0315] it should also be noted that, although this example details production of a fulMength murine monoclonal antibody, 
the method is quite capable of producing hybrid antibodies (e.g. a combination of human and murine sequences; 'hu- 
manized' monodonai antibodies), as well as useful antibody fragments, known to one skilled in the art such as Fab, 
Fc, F(ab) and Fv fragments. This method can be used to produce molecules containing the specific areas thought to be 
the antigen recognition sequences of antibodies (complementarity determining regions), linked, modified or incorporated 
into other proteins as desired. 

EXAMPLE 19 

Treatment of rats with a transposon-baseti vector for tissue- specific insulin gene incorporation 

[031 6J Rats are made diabetic by administering the drug streptozotocin (Zanosar; Upjohn, Kalamazoo, Mi) at approx- 
imately 200 mg/kg. The rats are bred and maintained according to standard procedures. A transposon-based vector 
containing a proinsulin gene, an appropriate carrier, and, optionally, a transfection agent, are injected into rats' singhepatic 
(if using G6P) artery with the purpose of stable transformation, incorporation of the insulin gene into the rat genome and 
levels of insuHn expression are ascertained by a variety of methods known in the art. Blood and tissue samples from 
live or sacrificed animals are tested, A combination of PGR, Southern and Northern blots, in-situ hybridization and related 
nucleic acfd analysis methods are used to determine incorporation of the vector-derived proinsulin DNA and levels of 
transcription of the corresponding rnRNA in various organs and tissues of the rats. A combination of SDS-PAGE gets, 
Western Blot analysis, radioimmunoassay, and ELISA and other methods known to one of ordinary skill in the art are 
used to determine the presence of insulin and the amount produced. Additional transactions of the vector are used to 
increase protein expression if the initial amounts of the expressed insulin are not satisfactory, or if the level or expression 
tapers off. The physiological condition of the rats is closeiy examined post-transfection to register positive or any negative 
effects of the gene therapy. Animals are examined over extended periods of time post-transfection in order to monitor 
the stability of gene incorporation and protein expression. 

EXAMPLE 20 

Exemplary Transposon-Bas&d Vectors 

[0317] The following example provides a description of various transposon-based vectors of the present invention 
and several constructs for insertion into the transposon-based vectors of the present invention. These examples are not 
meant to be limiting in any way. The constructs for insertion into a transposon-based vector are provided in a cloning 
vector labeled pTnrVIGS. 

PZDMQS .(base vector) 

[0318] 

Bp 1 ~ 130 Remainder of F1 {-) oh of pBluescripfii sk{-) (Stratagene) bp1-130 

Bp 133 - 1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems) bp2 29-1873 
Sp 1783 - 2991 Transposase, from Tn 10 {GenBank accession #J01829) bp 108-1316 
Bp 2992 - 3344 Non coding DMA from vector pNK2859 
Bp 3345 - 3387 Lambda DNA from pNK2859 
Bp 3388 ~ 3457 70 bp of IS 10 left from Tn10 

Bp 3464 - 3870 Ivlultaple cloning site from pBluescriptii sk(-)> thru the Xmal site bp924-718 

Bp 3871 - 3715 Muitiple cioning site from pBluescriptil sk(-), from the Xmal site thru the Xhol site. These base pairs 
are usually lost: when cloning into pTnMCS.bp 717-673 
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Bp 3716 - 4153 MultipSe cloning site from pBlueschptti sk(~), from the Xhoi s?te bp672~235 
Bp 4159 - 4228 70 bp of IS 10 ieft from Tn10 
Bp 4229 - 4270 Lambda DNA from pNK2859 
Bp 4271 - 51 14 Non-coding DNA from pNK2S59 

Bp 5115- 7315 pBEuesehpt sk (~) base vector {Stratagene, inc.) bp 761-2961 
pTnMCS (CMV-preoro-ent-hG H-CPAl 
[0319] 

Bp 1 - 3670 from vector PTnMCS s bp 1 - 3670 

Bp 3676 - 5320 CMV promoter/enhancer taken from vector pGWiZ (Gene Therapy Systems), bp 230-1864 
Bp 5326 - 5496 Capsite/Prepro taken from GenBank accession # X07404, bp 563 - 733 

Bp 5504 - 5652 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site 
Bp 5653 - 6308 Human growth hormone taken from GenBank accession # V00519, bp 1-654 
Bp 631 3 - 6720 Conalbumin poiyA taken from GenBank accession # Y00407, bp 1 0651 -11 058 
Bp 6722 -10321 from cloning vector pTnMCS, bp 3716-7315 

pTnMCS :rCMV-CHOVg'em-Proinsulin-svnPAl{SEQ ID NO:41 J: 
[0320] 

Bp 1 - 367G from vector PTnMCS, bp 1 - 3670 

Bp 3876 - 5320 CMV promoter/enhancer taken from vector pGWiZ (Gene Therapy Systems), bp 230-1864 

Bp 5327 -6480 Chicken ovalbumin gene taken from GenBank accession # VG0383, bp 66-1219 

Bp 8487 - 6636 Synthetic spacer sequence and hairpin loop of HIV gp41 weth an added enterokinase cleavage site 

Bp 8637 - 6897 Human Proinsufin taken from GenBank accession # NM000207, bp 1 17-377 

Bp 6898 - 6942 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Bkmt \\ (Invstrogen) and pGWIZ 

(Gene Therapy Systems} 

Bp 6943 - 7295 Synthetic poiyA from the cloning vector pGWiZ (Gene Therapy Systems), bp 1920-2271 
Bp 7296 -10895 from doning vector pTn£V1CS ; bp 3716-7315 
pTn M OS i CMV ~s>reptQ~ent~ P ro I nsu i in-syn P A| 
Bp 1 - 3670 from vector PTnMCS, bp 1 - 3670 

Bp 3876 - 5320 CMV promoter/enhancer taken from vector pGWiZ {Gene Therapy Systems), bp 230-1 864 
Bp 5326 - 5496 Capsite/Prepro taken Iron GenBank accession # X074G4, bp 563 ■ 733 

Bp 5504 - 5652 Synthetic spacer sequence and hairpin foop of HIV gp41 with an added enterokinase cleavage site 
Bp 5653 - 5913 Human ProinsuJin taken from GenBank accession # NM000207. bp 117-377 
Bp 5914 - 5958 Spacer DNA, derived as an artifact from the cloning vectors pTOPO BJunt II (mvitrogen) and pGWIZ 
(Gene Therapy Systems) 

Bp 5959 - 6310 Synthetic poiyA from the cloning vector pGW^Z (Gene Therapy Systems), bp 1920-2271 
Bp 8313 - 9912 from cloning vector pTnMCS, bp 3716-7315 

[0321] 

Bp 1 - 3670 from vector pTnMCS, bp 1 - 3870 

Bp 3676 • 4350 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1 bp 1-875 

Bp 4357 - 5692 Chicken Ovalbumin promoter taken from GenBank accession # JG0895M2.4999 bp 1-1338 

Bp 5699 - 891 7 Chicken Ovalbumin gene from GenBank Accession # VGQ383. 1 bp 2-1 220, (This sequence includes 

the 5HJTR, containing putative cap site, bp 5699-5762.) 

Bp 6924 - 7073 Synthetic spacer sequence and hairpin loop of HIV gp41 wtth an added enterokinase cleavage site 
Bp 7074 - 7334 Human proinsolin GenBank Accession # NM000207 bp 1 17-377 

Bp 7335 - 7379 Spacer DNA, derived as an adifact from the don&ng vectors pTOPO Bhjnt !! (Snvitrogen) and gWIZ 
(Gene Therapy Systems) 

Bp 7380 - 7731 Synthetic polyA from the -cloning vector gWIZ (Gene Therapy Systems) bp 1920 - 2271 
Bp 7733 -1 1332 from vector pTnMCS, bp 3716 - 7315 
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gTriMCS^Chicken QVep^i>feprQ^ENT^proins+svn poJyA> 



[0322] 



Bpt - 3670 from cloning vector pTnMGS, bp 1 « 3670 

Bp 3676 - 4350 Chicken Ovalbumin enhancer taken from GenBank accession # S82527.1 fop 1-675 

Bp 4357 - 5692 Chicken Ovalbumin promoter taken from GenBank accession # J00895-M24999 bp 1-1336 

Bp 5699 - 6889 Cecropin cap site and Prepro, Genbank accession # XG7404 bp 563-733 

Bp 5876 - 6025 Synmetie spacer sequence and hairpin 3oop of HIY gp41 with an added enierokinase cleavage site 
Bp 8026 - 6286 Human proinsu^n GenBank Accession # NM000207 bp 1 17-377 

Bp 6287 - 6331 Spacer ON A, derived as an artifact from the cloning vectors pTOPO Blunt II (fnvilrog&n) and gWIZ 
(Gene Therapy Systems) 

Bp 6332 - 6683 Synthetic poiyA from the cloning vector gWlZ (Gene T herapy Systems) bp 1920-2271 
Bp 6685 - 10284 from cloning vector pTnMCS, bp 3716 - 7315 

pTnMG S(Ouail GVep+QVq^£NT+proins~syn polyA) 



Bp 1 - 3670 from cloning vector pTnMCS, bp 1 - 3670 

Bp 3678 - 4333 Quail Ovalbumin enhancer; 858 bp sequence, amplified in-house from quaii genomic DNA, roughly 
equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession # S82527. 1 ( bp 1 -875. (There are 
mult >ple bass pair substitutions and deletions in the quail sequence, relative tochicken, so the number of bases does 
not correspond exactly.} 

Bp 4340 - 5705 Quaii Ovalbumin promoter: 1368 bp sequence, amplified in-house from quaH genomic DNA. roughly 
corresponding to chicken ovalbumin promoter, GenBank accession # J00895-M24999 bp 1 -1 336. ( T here are multiple 
base pair substitutions and deletions between the quaii and chicken sequences, so the number of bases does not 
correspond exactly.) 

Bp 571 2 ~ 691 0 Quasi Ovalbumin gene, EMSL accession # X53964, bp 1 -11 99. (This sequence includes the 5'UTR, 
containing putative cap site bp 5712-5784.) 

Bp 6917 - 7068 Synthetic spacer sequence and hairpin foop of HiV gp41 with an added enterokinase cieavage site 
Bp 7067 - 7327 Human proinsulin GenBank Accession # NM000207 bp 1 17-377 

Bp 7328 - 7372 Spacer DNA, derived as an artefact from the cloning vectors pTOPO Siunt \\ (mvitrogen) and gWEZ 
(Gene Therapy Systems) 

Bp 7373 ~ 7724 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271 
Bp 7728 - 1 1325 from cloning vector pTnMCS, bp 3716 - 7315 



Bp 1 - 3670 from vector PTnMCS, bp 1-3670 

Bp 3676 - 4350 Chicken Ovaibumin enhancer taken from GenBank accession # 862527-1 f bp 1 -675 

Bp 4357 - 5692 Chicken Ovalbumin promoter taken from GenBank accession # J00899-M24939, bp 1-1336 

Bp 5699 - 5869 Capsite/Prepro taken fron GenBank accession # X07404, bp 563-733 

Bp 5877 - 6025 Synthetic spacer sequence and hairpin loop of H3V gp41 with an added enterokinase cleavage site 
Bp 8026 - 6679 Human growth hormone taken from GenBank accession # V00519, bp 1-654 
Bp 68S6 - 7093 Conalbumin poiyA taken from GenBank accession # Y00407 t bp 10861-1 1058 
Bp 7095 - 10694 from cfoning vector pTnfvlCS, bp 3716-7315 

pTn MCS(Ouai i O Vep^ prepro-f ENT+proins-f syn polyA? 




Bp 1 - 3670 from cloning vector pTnMCS, bp 1 - 3670 

Bp 3876 - 4333 Quail Ovaibumin enhancer: 658 bp sequence, amplified ievhouse from quail genomic DNA, roughly 
equivalent to me far- upstream chicken ovaibumin enhancer GenBank accession #S82527.1 f bp 1 -675. (There are 
multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases 
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does not correspond exactly.) 

Bp 4340 - 5705 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-house from quail genomic DNA S roughly 
corresponding to chicken ovalbumin promoter, GenBank accession # J00895-M24999 bp 1 ~ 1 338. (There are multiple 
base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not 
correspond exactly,) 

Bp 5712 - 5882 Cecropin cap site and Prepro, Genbank accession # XQ7404 bp 563-733 

Bp 5889 ~ 6038 Synthetic spacer sequence and hairpin loop of HIV gp4t with an added enterokinase cleavage site 
Bp 6039 - 6299 Human proinsulm GenBank Accession if NMG00207 bp 1 17-377 

Bp 6300 - 6344 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Biunt II {(nvitrogen) and gWIZ 
(Gene Therapy Systems) 

Bp 6345 - 6696 Synthetic poiyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920 - 2271 
Bp 6698 - 10297 from cloning vector pTnMCS, bp 3716 - 7315 



Bp 1 - 130 remainder of F1 (~) oh of pBiuescnptM sk(~) (Stratagene) bp1-13Q 

Bp 1 33 - 1777 GIV1V promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems) bp229~ 1873 
Bp 1783 - 2991 Transposase, modified from Tn10 (GenBank accession #J01829) bp 108-1316 
Bp 2992 - 2994 Engineered stop codon 

Bp 2996 - 341 1 Synthetic po§yA from gWIZ (Gene Therapy Systems) bp 1922 - 2337 
Bp 3412 - 3719 Non-coding DNA from vector pNK28 59 
Bp 3720 - 3762 Lambda DNA from pNK2859 
Bp 3763 - 3832 70 bp of IS 10 left from TniO 

Bp 3339 ~ 4045 Mystipse cloning site from pBlue&criptfl sk(-), thru the Xrnal site bp 924-718 

Bp 4045 - 4090 Multiple cloning site from pBlueseriptH sk(-), from the Xmai site thru the Xhol site. These base pairs 
are usually lost when cloning into pTnMCS. bp 717-673 

Bp 4091 - 4528 Multiple cloning site from pBiuescriptH sk{-) t from the Xhoi site bp 672-235 
Bp 4534 - 4603 70 bp of ?S10 left from Tn 10 
Bp 4604 - 4645 Lambda DNA from pNK2859 
Bp 4646 - 5489 Non-coding DNA from pNK2859 

Bp 5490 - 7690 pBiueseript sk {-} base vector (Stratagene, INC) bp 761-2961 
MOD {CHOVep-prepro-enf~hGH-GPAj 



Bp 1 - 4045 from vector PTnMCS. bp 1 - 4045 

Bp 4051 - 4725 Chicken Ovalbumin enhancer taken from GenBank accession # 582527.1, bp 1 - 675 

Bp 4732 - 6067 Chicken Ovalbumin promoter taken from GenBank accession # J0G899-M24999, bp 1-1336 

Bp 8074 - 6245 Caps ite/P rep ro taken iron GenBank accession # X07404 ; bp 583 - 733 

Bp 6252 - 6400 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site 
Bp 6401 - 7G54 Human growth hormone taken from GenBank accession # V00519, bp 1-854 
Bp 7081 - 7488 Conaibumin poiyA taken from GenBank accession # Y00407, bp 10651-1 1058 
Bp 7470 - 11 069 from cloning vector pTnMCS, bp 3716-7315 

pTnMOD {CMV-GHOVg~ent"Prolnsu!in-svnPA} (SEQ ID NO:42) 

t..v>^^ .w . w . >w ^»s"^"«ss"WV"^^tv: ............. .V.>.yt^W>SVAiSl^W>SV,W*VrW^W««W 



Bp 1 - 4045 from vector PTnMCS, bp 1 - 4045 

Bp 4051 - 5695 CMV promoter/enhancer taken from vector pGVViZ (Gene therapy systems), bp 230-1864 

Bp 5702 -8855 Chicken ovalbumin gene taken from GenBank accession U V00383, bp 86-1219 

Bp 6862 - 701 1 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site 

Bp 7012 - 7272 Human ProinsuHn taken from GenBank accession # NM0Q0207, bp 1 17-377 

Bp 7273 - 7317 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Bkjnt \\ (mvitrogen) and pGVvlZ 

(Gene Therapy Systems) 



PTnMOD 
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Bp 7318 - 7670 Synthetic polyA from the cloning vector pGWIZ (Gene Therapy Systems), bp 1920-2271 
Bp 7672 ~1 1271 from cfonmg vector pTnMCS, bp 3716-7315 



Bp 1 ~ 4045 from vector PTnMCS, bp 1 - 4045 

Bp 4051 - 5895 CMV promoter/enhancer taken from vector pGWiZ (Gene therapy systems), bp 230-1864 
Bp 5701 - 5871 Capsite/Prepro taken fron GenBank accession # X07404. bp 563 - 733 

Bp 5879 - 6027 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added ©nterokinase cleavage site 
Bp 6028 - 6681 Human growth hormone taken from GenBank accession # V00519, bp 1-854 
Bp 6688 - 7095 Conaiburnin polyA taken from GenBank accession # Y00407, bp 10651-11058 
Bp 7097-10696 from cloning vector pTnMCS, bp 3716- 7315 

pTn MOD (CM V; preprp-e nt : P ro^s^ is n -gmj^ 



Bp 1 - 4045 from vector PTnMCS, bp 1 - 4045 

Bp 4051 - 5895 CMV promoter/enhancer taken from vector pGWiZ (Gene therapy systems), bp 230-1864 
Bp 5701 - 5871 Capsite/Prepro taken from GenBank accession # X07404/bp 563 - 733 

Bp 5879 - 6027 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site 
Bp 6028 - 6288 Human Proinsuiin taken from GenBank accession H r4M000207 9 fop 11 7-377 
Bp 8289 - 6333 Spacer ON A, derived as an artifact from the cloning vectors pTOPO Blunt M (Envitrogen) and pGWE-Z 
(Gene Therapy Systems) 

Bp 6334 - 6685 Synthetic polyA from the ctoning vector pGWIZ (Gene Therapy Systems), bp 1920-2271 
Bp 6687 -10286 from cionang vector pTnMCS, bp 3716-7315 

pTnMODCChicken OVep^QVc^ENT-ffiroins^svn poivA) fSEQ ID NO: 43} 



Bp 1 - 4045 from cloning vector pTnMOD. bp 1 - 4045 

Bp 4051 - 4725 Chicken Ovalbumin enhancer taken from GenBank accession # S82527.1 bp 1-675 

Bp 4732 - 8067 Chicken Ovalbumin promoter taken from GenBank accession # J00895-M24999 bp 1-1336 

Bp 6074 - 7292 Chicken Ovalbumin gene from GenBank Accession # V00383.1 fop 2-1 220. {This sequence includes 

the 5UTR, containing putative cap site bp 6074-6137,} 

Bp 7299 - 7448 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage srie 
Bp 7449 - 7709 Human proinsulin GenBank Accession # NM000207 bp 1 17-377 

Bp 7710 - 7754 Spacer .DMA, derived as an artifact from the cloning vectors pTOPO Blunt I! (invitrogen) and gWfZ 
(Gene Therapy Systems} 

Bp 7755 - 8106 Synthetic polyA from She cloning vector gW!Z (Gene Therapy Systems) bp 1920-2271 
Bp 8108 - 11707 from cloning vector pTnMCS, bp 3716 - 7315 




Bp 1 - 4045 from cloning vector pTnMCS, bp 1 - 4045 

Bp 4051 - 4725 Chicken Ovalbumin enhancer taken from GenBank accession t* S82527.1 bp 1-675 

Bp 4732 - 6067 Chicken Ovalbumin promoter taken from GenBank accession # J00895-M24999 bp 1-1336 

Bp 6074 - 6244 Cecropin cap site and Prepro, Genbank accession # X07404 bp 583-733 

Bp 6251 - 6400 Synthetic spacer sequence and hairpin §oop of HiV gp41 with an added enterokinase cleavage site 
Bp 8401 - 6661 Human proinsulin GenBank Accession # NM00G2Q7 bp 117-377 

Bp 6662 - 6706 Spacer ON A, derived as an artifact from the cloning vectors pTOPO Blunt H (Invitrogen) and gWlZ 
(Gene Therapy Systems) 

Bp 6707 - 7058 Synthetic polyA from the cloning vector gWl'Z (Gene Therapy Systems) bp 1920 - 2271 
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Bp 7080 - 10659 from cloning vector pTnMC8 ; bp 3716 - 7315 
&TnMGD<Quaii OVep+GW + ENT+prosns+syn polvA) 
[0333] 

Bp 1 - 4045 from cloning vector pTnMCS, bp 1 - 4045 

Bp 4051 - 4708 Quasi Ovalbumin enhancer: 658 bp sequence, amplified in-house from quasi genomic DNA S roughly 
equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession # S82527.1 , bp i-875. (There are 
multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases 
does not correspond exactly.) 

Bp 471 5 - 8080 Quail Ovalbumin promoter: 1366 bp sequence, amplified irvbouse from quail genomic DNA : roughly 
corresponding to chicken ovalbumin promoter, GenBank accession # J0G895-M24999 bp 1 .-1 336. (There are multiple 
base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not 
correspond exactly.) 

Bp 6087 - 7285 QuaH Ovalbumin gene, EMBL accession # X53 9 64 5 bp 1-1 199. (This sequence includes the o'UTR ; 
containing putative cap site bp 6087-6139.) 

Bp 7292 - 7441 Synthetic spacer sequence and hairpin ioop of HIV gp41 with art added enterokinase cleavage site 
Bp 7442 - 7702 Human proinsulin GenBank Accession # NM000207 bp 1 17-377 

Bp 7703 - 7747 Spacer DNA t derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWS2 
(Gene Therapy Systems) 

Bp 7748 - 8099 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920 - 2271 
Bp 8101 - 1 1 700 from cloning vector pTnMCS, bp 3716 - 7315 

pTnMOD(Ouaif OVep+prepro+ENT+protns+svn polyA) 

[0334] 

Bp 1 -4045 from cloning vector pTnMCS, bp 1 - 4045 

Bp 4051 ~ 4708 Quail Ovalbumin enhancer: 653 bp sequence, amplified in-housefrorn quasi genomic ON A, roughly 
equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #38252711, bp 1-675. (There are 
multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so ihe number of bases 
does not correspond exactiy.) 

Bp 4715 • 6080 Quail Ovalbumin promoter: 1368 bp sequence, amplified in~house from quasi genomic DNA t roughly 
corresponding to chicken ovalbumin promoter, GenBank accession # J00895-M24999 bp 1 - 1 336. (There are multiple 
base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not 
correspond exactly/) 

Bp 6087 - 8257 Cecropin cap site and Frepro, Genbank accession ft X07404 bp 563-733 

Bp 6284 - 8413 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site 
Bp 8414 - 6874 Human proinsulin GenBank Accession # NM000207 bp 1 17*377 

Bp 8675 - 6719 Spacer ON A, derived as an artifact from the cloning vectors pTOPO Blunt 11 (invitrogen) and gWIZ 
(Gene Therapy Systems) 

Bp 6720 - 7071 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920 - 2271 
Bp 7073 - 10872 from cloning vector pTnMCS : bp 3716 - 7315 

[0335J 

BP 1-130 remainder of F1 (») oh of pBlueseripiil ski» (Stragagene) bp 1-130. 

BP 133-1777 CMV promoter/enhancer taken from vector pQWIZ (Gene Therapy Systems) bp 229-1873, 

BP 1780-2987 Transposase, modified from Tn10 (GenBank #J01829). 

BP 2988-2990 Engineered stop codon. 

BP 2991-3343 non coding ON A from vector pNK2859„ 

BP 3344-3386 Lambda DNA from pHK2859. 

BP 3387-3456 70bp of 1310 left from To 10. 

BP 3457-3874 multiple cloning site from pBlueseriptil sk(-) bp 924-707. 

BP 3875-5891 Chicken Ovalbumin enhancer plus promoter from a Topo Clone 10 maxi 040303 (5' Xma£, 3' BarnHi) 
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BP 5698-5865 prepro with Cap site amplified from eecropin of pfv1ON200 
GenBank H X07404 (5*BamH!, 3 : Kpnl) 

BP 5872-7338 Protein A gene from GenBank# JQ1788, mature peptide bp 292-1755 (5'Kpnl, 3'Saei}) 

BP 7345-7752 ConPoiyA from Chicken conafburnin polyA from GenBank # Y00407 bp 1 065 1-11 058. (5'Sacl 1 , 3'Xhoi } 

BP 7753-8195 multiple cloning site from pBiuescriptM sk(~) bp 677-235. 

BP 8196-8265 70 bp of IS10 left from Tn1G. 

BP 8266-8307 Lamda DMA from pNK2859 

BP 8308-9151 noncoding DNAfrom pNK2859 

BP 9152-1 1362 pBiuescnpt^ sk{~) base vector (Stratagene, INC.) bp 761-2961 

[0336] All patents, publications and abstracts cited above are incorporated herein by reference in their entirety. It 
should be understood that the foregoing rebates only to preferred embodiments of the present invention and that numerous 
modifications or alterations may be made therein without departing from the spirit and the scope of the present invention 
as defined in the following claims. 

Appendix A 

[0337] 
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SEQ ID NO:l {pTnMod) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG SO 
GGCAGCGTGA CCGCTACACT TGGGAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CCACGTTGGC CGGCATCAGA TTGGCTATTG ISO 
GCCATTGGAT ACGTTGTATC CATATCATAA TATGTACATT TATATTGGCT 2 00 
CATGTCCAAC ATTACCGCCA TGTTGAC&TT GATTATTGAG TAGTTATTAA 2 50 
TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGG&GTTCGG 3 00 
GGTTACATAA CTTAGGGTAA ATGGCCCGCC TGGCTGACCG CCCAAGGACC 3 50 
CCCGCCCATT GACGTCAATA ATGACGTATG TTGCCATAGT AAGGCCAATA 4 00 
GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 4 SO 
CTTGGCAGTA CATCAAGTGT ATCATATGCC &AGT&CGCCC CCTATTGACG 500 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 550 
TGGGACTTTC CTACTTGGCA GTAGATCTAC GTATTAGTCA TCGGTATTAC 6 00 
CATGGTGATG CGGTTTTGGG AGTACATGAA TGGGCGTGGA TAGCGGTTTG 650 
ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 700 
TTTTGGC&CC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 750 
CCGATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCT'ATATAA 8 00 
GCAGAGCTCG TTTAGTGAAG GGTGAGATCG CCTGGAGAGG CCATCCACGC 850 
TGTTTTG AC C TCCATAGAAG ACACCGGGAC CGATCCAGCC TGGGCGGCCG 900 
GGAACGGTGC ATT GGAACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GGTATAGGTG 1050 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCAGT 1100 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 
GCCACAACTA TCTCTATTGG CTATATGCCA AT ACT CTGT C CTTCAGAGAC 1200 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTGCCATTT ATTATTTAGA 1250 
AATTCACATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 1300 
CATAGCGTGG GATCTCCACG CGAATCTCGG GTACGTGTTC CGGACATGGG 1350 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 1400 
CTCCAGCGGC TCATGGTGGG TGGGCAGCTC CTTGCTCCTA AGAGTGG&GG 1450 
CCAGACTTAG GGACAGCACA ATGCCCACCA CCACGAGTGT GCCGCACAAG 1500 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 
CACGGGTGAC GCAGATGGAA GACTTAAGGC AGCGQCAGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTA&C TCCCGTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 1700 
CGCGCGCGGC A CC AG A CAT A AT AG CTG AC A GACTAACAGA CTGTTCCTTT 1750 
GCATGGGTCT TTTGTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 1800 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAAACGACT 1 B 5 Q 
CAACAGCTTA ACGTTGGCTT GGCAGGCATT ACTTGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAftAGCGAG AACAAAACAT 1950 
AACATCAAAC GAATCGAGCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2 05 0 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGC ACT AG AGGGTCGTTC TGTTACTCTT 215 0 
TATGAGAAAG CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 2200 
CCAATTTCTA GCCGACCTTG CGAGGATTCT ACCGAGTAAC ACCAGAGGGC 2250 
TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTOAG 23 00 
AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 2350 
AGACGTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 24 00 
CATCTAGTCA CTCAAAGACT TTAGGCTATA AGAGGCTGAC TAAAAGCAAT 24 50 
CCAATGTCAT GCCA^TTCT ATTOTATAAA TCTGGCTCTA AAGGCCGAAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTCA GGACCCGTCA CCTAAAATCT 2550 
ACTCAGCGTC GG C AAAGG AG CCATGGGTTC TAGCAACTAA GTTACGTGTT 2600 
GAAATTCGAA CACCCAAAC& ACrTGTTAAT ATCTATTCGA AGCGAATGCA 2650 
GATTGAAGAA ACCTTCCGAG ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 2700 
TAGGCCATAG CCGAACGAGC AGCTCAGAGC GTTTTGATAT CATGCTGCTA 2750 
ATCGCCCTG& TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2S00 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAA&TCG AA 2 8 5 0 
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&CGTACTCTC &ACAGTTCGC TTAGGCATGG AAGTXTTGCG GCATTCTGGC 2900 
TACACAATA& CAAGGG&&GA CTTACTCGTG GCTGCAACCC TACTAGCTCA 2 950 
AAATTTATTC ACAC&TGGTT ACGCTTTGGG GAAATTATGA TAATGATCC& 3 000 
GATCACTTCT GGCTA&TAAA AGATCAGAGC TCTAGAGATC TGTGTGTTGG 3 050 
TTTTTTGTGG ATGTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 
CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACXGTCCT 3150 
TTCCTA&TAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3 200 
CTATTCTGGG GGGTGGGGTG GGGCAGCACA GCAAGGGGGA GGATTGGGAA 3 2 50 
GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTC TATGG GTACCTCTCT 3300 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 3 350 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACC AGG TGCTGAAGAA 3 4 00 
TTGACCCGGT G&CCAAAGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 34S0 
CAATTACTCA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3500 
CATCACAACA AAA&CTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3 550 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3 600 
CGTATCCAAG &AGTGATGCC TATCATTGGT TGGAATGAAC TTG&AA&AAA 3650 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGCCATTGT CAGCAAATTG 3 7 00 
ATCCAAGAGA ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 3750 
GACCCTGAGC AGTGATGAAT CCCCT&ATGA TTTTGGTAAA AATCATTAAG 3 800 
TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3 850 
CACT CATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3 900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTG ACACAGGAAA CAGCTATGAC 3950 
CATGATTACG CC&AGCGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 000 
GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGC 4 050 
TGCAGGAATT CGATATCA&G CTTATCGATA CCGCTGACCT CGAGGGGGGG 4100 
CCCGGTACCC AATTCGCCCT ATAGTGAGTC GTATTACGCG CGCTCACTGG 4150 
CCGTCGTTTT ACAACGTCGT G&CTGGGAAft ACCCTGGCGT TACCCAACTT 4 2 00 
AATCGCCTTG CAGCACATCC CCCTTTCGCC AGCTGGCGTA ATAGCGAAGA 4250 
GGCCCGCACC GATCGCCCTT CCC&ACAGTT GCGCAGCCTG AATGGCGAAT 4 3 00 
GGAAATTGTA AGCGTTAATA TTTTGTTAAA ATTCGCGTTA AATTTTTGTT 4 350 
AAATCAGCTC ATTTTTTA&C CAATAGGCCG AAATCGGCAA AATCCCTTAT 4 4 00 
AAATCAAAAG AATAGACCGA GATAGGGTTG AGTGTTGTTC CAGTTTGGAA 44 50 
CAAGAGTGCA CTATTAAAGA ACGTGGACTC CAACGTCAAA GGGCGAAAAA 4 500 
CCGTCTATCA GGGCGATGGG CCACTACTCC GGGATCATAt GACAAGATGT 4 550 
GTATCCACCT TA&CTTAATG ATTTTTACCA AAATCATTAG GGGATTCATC 4 600 
AGTGCTCAGG GTCAACGAGA ATTAACATTC CGTCAGGAAA GCTTATGATG 4 6 50 
ATGATGTGCT TAAAAACTTA CTCAATGGCT GGTTATGGAT ATGGCAATAC 4 700 
ATGCGAAAAA CCTAAAAGAG CTTGCCGATA AAAAAGGCCA ATTTATTGCT 4 750 
ATTTACCGCG GCTTTTTATT GAGCTTGAAA GATAAATAAA ATAGATAGGT 4 8 00 
TTTATTTG&A GCTAAATCTT CTTTATCGTA AAAAATGCCC TCTTGGGTTA 4850 
TCAAGAGGGT CATTATATTT CGCGGAATAA CATCATTTGG TGACGAAATA 4 900 
ACTA&GC&CT TGTCTCCTGT TTACTCCCCT GAGCTTGAGG GGTTAACATG 4 950 
AAGGTCATCG ATAGGAGGAT MTAATACAG TAAAAOGCTA AACCAATAAT 5 000 
CCAAATCCAG CCATCCCAAA TTGGTAGTGA ATGATTATAA AT AAC AG C AA SOSO 
ACAGTAATGG GCCAATAACA CCGGTTG CAT TGGTAAGGCT CACCAATAAT 5100 
CCCTGTAAAG CACCTTGCTG ATGACTCTTT GTTTGGATAG ACATCACTCC 5X5 0 
CTGTAATGCA GGT&AAGCGA TCCCACCACC AGCCAATAAA ATTAAAACAG 5 200 
GGAAAACTAA CCAACCTTCA GATATAAACG CT A&AA&GGC AAATGCACTA 52 50 
CTATCTGCAA TA&ATCCGAG CAGTACTGCC GTTTTTTCGC CC ATTTAGTG 53 00 
GCTATTCTTC CTGCCACAAA GGCTTGGAAT ACTGAGTGTA AAAGACCAAG 5350 
ACCCGTAATG AAftAGCCAAC CATCATGCTA TT CAT CAT C A CGATTTCTGT 54 00 
AATAGCAGGA CACCGTGCTG GATTGGCTAT CAATGCGCTG AAATAATAAT 5450 
CAACAAATGG CATCGTTAAA TAAGTGATGT ATACCGATCA GCTTTTGTTC 5500 
CCTTTAGTGA GGGTTAATTG CGCGCTTGGC GTAATCATGG TCATAGCTGT 5550 
TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC 5600 
GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC 5650 
ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT 5700 
GCCAGCTGCA TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT 5750 
ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC GCTCGGTCGT 5800 
TCGGCTGCGG CGAGGGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 5850 
CCACAGAATC &GGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG 5500 
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10 



15 



20 



25 



30 



4Q 



45 



50 



0 A 

GGCGAA&CCC 
TCGCTGGTGC 
CGCCTTTCTC 
GGTATCTCAG 
GAACCCCCCG 
TGAGTCCAAC 
GTAACAGGAT 
A&GTGGTGGG 
CGCTCTGCTG 
GCGGCAAACA 
CAGATTACGC 
TAGGGGGTCT 
TCATGAGATT 
TGAAGTTTTA 
TTAGGAATGC 
GTTCATCGAT 
GAGGGGTTAC 
CTCACCGGCT 
AGCGGAGAAG 
TGTTGCCGGG 
CGTTGTTGCC 
TGGCTTCATT 
CCCATGTTGT 
CAGAAGTAAG 
ATAATTCTCT 
G&GTACTCAA 
CTCTTGCCCG 
TAAAAGTGGT 
ATCTTACCGC 
CTGATCTTCA 
CAGGA&GGCA 

TTATTGTCTC 
AAATAGGGGT 



GGAACGGTAA AAAGGGCGCG 
CCTGACGAGC 



CCGGTGCGCC 



GACAGGACTA 
GCTCTCCTGT 
CCTTGGGGAA 
TT CGGTGTAG 
TTC AGCCCG A 
CCGGT&AGAC AGGACTTATC 
TAG C AG AG CG AGGTATGTAG 
CT A&CTACGG CTACACTAGA 
AAGCCAGTTA CCTTCGGAAA 
AACCACCGCT GGTAGCGGTG 
GCAG&AAAAA AGGATCTCAA 
GACGCTCAGT GGA&CG&AftA 
ATC&AAAAGG ATCTTCACCT 



AATCAATCTA 
TTAATC&GTG 



AAGTATATAT 
AGGCACCTAT 



AGTTGCCTGA CTCCCCGTCG 
CATGTGGCCC GAGTGCTGCA 
CCAGATTTAT CAGCAATAAA 
TGGTCCTGCA ACTTTATCCG 
AAGCTAGAGT AAGTAGTTCG 
ATTGCTACAG GCATCGTGGT 
CAGCTCCGGT TCGCAACGAT 
GCAAAAAAGC GGTTAGCTGC 
TTGGCCGCAG TGTTATCACT 
TACTGTCATG CCATCCGTAA 
CCAAGTCATT CTGAGAATAG 
GCGTCAATAC GGGATAATAC 
CAT G ATTGG A 
TGTTGAGATG 
GCATCTTTTA 



AAATGCCGCA AAAAAGGGAA 
TACTCTTCCr TTTTCAATAT 
ATGAGCGGAT ACATATTTGA 
TCCGCGCACA TTTCCCCGAA 



TTGCTGGCGT 
TCGACGCTCA 
AGGCGTrTCG 
CCGCTTAGCG 
TTCTCATAGC 
CC&AGCTGGG 
TTATCCGGTA 
GCCACTGGCA 
GCGGTGCTAC 
AGGACAGTAT 
AAGAGTTGGT 

GAAGATCGTT 
CTC&CGTTAA 
AGATCGTTTT 
GAGTAAACTT 
CTCAGCGATG 
TGTAGATAAC 
ATGATACCGG 
CCAGCCAGCG 
CGTGGATCCA 
CCAGTTAATA 
GTCACGCTCG 
CAAGGCGAGT 
TTCGGTCGTC 
CATGGTTATG 
GATGCTTTTC 
TGTATGCGGC 
CGCGCCACAT 
CGGGGCGAAA 

CGTTTCTGGG 
TAAGGGCGAC 
TATTGAAGCA 
ATGTATTXAG 
AAGTGCCAC 



TTTTCCATAG 
AGTGAGAGGT 



GATACCTGTC 
TCACGGTGTA 
CTGTGTGCAG 
ACTATGGTGT 
GCAGCCACTG 
AGAGTTGTTG 
TTGGTATCTG 
AGCTCTTGAT 
TTGCAAGCAG 
TGATCTTTTC 
GGGATTTTGG 
AAATTAAAAA 
GGTCTGACAG 
TGTCT ATTT C 
TACGATACGG 
GAGACCCACG 
GGAAGGGCCG 
GTCTATTAAT 
GTTTGGGGAA 
TCGTTTGGTA 
TAC&TGATCC 
CGATCGTTGT 
GCAGCACTGC 
TGTGACTGGT 
GACCGAGTTG 
<m>0S C^^^.CjJ^J^.'G'X"^ 
ACTCTCAAGG 
vst'X'GOi^bv^C^C.&jP^ 
TGAGCAAAAA 
ACGGAAATGT 
TTTATCAGGG 
AAAAATAAAC 



5950 
6000 
6050 
6100 

eiso 

6200 

6300 
S3 SO 
6400 
6450 
€500 
6550 
6600 
6650 
6700 
6750 
6800 
6850 
6£00 

69 SO 

70 GO 
7050 
7100 
7150 
7200 
7250 
7300 
7350 
7400 
7450 
7500 
7550 
7600 
7650 
7685 
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CTGACGCGCC 
CGCAGCGTGA 
TTTCTTCCCT 
GCCATTGC&T 
CATGTCCAAC 
1 , <&X> AjA^'X^'CT^^hAi 
CO'XTA C A*3T AA 
GCGGCCGATT 
GGGACTTTCC 
CTTGGCAGTA 
TCAATGACGG 
TGGGACTTTC 
GATGGTGATG 
ACTGACGGGG 
TTTTGGGACC 
CCCATTGAGG 
GCAGAGCTCG 
TGTTXTGACC 
GGAACGGTGC 
CCGCCTAXAG 
CTGTTTTTGG 



CTGTAGCGGC 
CCGCTACACT 
TCCTTTCTGG 
ACGTTGTATC 
ATTACCGCCA 
TTACGGGGTC 
CTTACGGTAA 
G AGG X'C A 
ATTGACGTCA 
CATCAAGTGT 
TAAATGGCCC 
CTACTTGGCA 
CGGTTXTGGC 
ATTTGCAAGf 
AAAATCAACG 
CAAATGGGGG 
TTTAGTGAAC 



GCATTAAGCG 
TGGCAGCGGC 
GCAGGTTCGC 
CATATCATAA 
TGTTGACATT 
A.TTAGTTCAT 
ATGGCCCGCC 
ATGACGTATG 
ATGGGTGGAG 
ATCATATGCC 



GTACATCTAC 
AGTACATGAA 
CTCCACCCCA 
GGACTTTCGA 
GTAGGCGTGT 
CGTCAGATGG 



ATTGGAACGC 
ACTCTATAGG 
CTTGGGGCCT 



CGGCGGGTGT 
CTAGGGCCGG 
CGGCATGAGA 
TATGTACATT 
GATTATTGAC 
AGCCCATATA 
TGGCTGACCG 
TTCCCATAGT 
TATTTACGGT 
AAGTACGCCC 
ATGCCCAGTA 
GTATTAGTCA 
TGGGCGTGGA 
TTGACGTCAA 
AAATGTGGTA 
ACGGTGGGAG 
CGTGGAGAGG 
CGATCCAGCG 
TGCCAAGAGT 



ATACACCCCC GCTTCCTT&T 



GGTGGTTACG 
CTCCTTTCGC 
TTGGCTATTG 
TATATTGGCT 
TAGTTATTAA 
TGGAGTTGCG 
CCGAACGACC 
AACGCCAATA 
AAACTGGCCA 
CGTATTGACG 
CATGAGCTTA 
TCGCTATTAC 
TAGCGGTTTG 
TGGGAGTTTG 
ACAAGTCCGG 
GTCTATATAA 
CCATCGACGC 
TCCGCGOCCG 
GACGTAAGTA 
0CATGCTATA 
GCTATAGGTG 



SO 

100 

ISO 

200 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

800 

0 5 0 

900 

950 

1000 

1050 
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ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCAGT 3.100 
CCCCTATTGG TGACG&TAOT TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTG CTTCAGAGAC 1200 
TGACACGGAG TGTGTATTTT TACAGGATGG GGTCCCATTT &TTATTTACA 1250 
AATTCAGATA TACAACAACG CCGTCCCCCG TGCCCGCAGT TTTTATTAAA 13 00 
CATAGCGTGG G&TGTCCACG CGAATCTCGG GTACGTGTTC CGQAGATGGG 1350 
CTCTTCTCCG GTAGCGGCGG AGGTTCCACA TCCOAGCGCT GGTCCCATGC 14 00 
CTGGAGCGGC TCATGGTCGG TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 14 50 
CCAGACTTAG GCACAGCACA ATGGCG&CC& CGACCAGTGT GCCGC&CAAG 1S00 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG &TTGGGCTCG 1550 
C ACGGCTG AC GO AGATGGAA GAGTTAAGGC AGCGGCAGAA GAAGATGGAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG IS 50 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGGTGC 1700 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 1800 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAAACGAGT 1850 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 1900 
CTGTTACCGA ACTTGGCCGT A&CCTGCCAA CCAAAGCGAG AACAAAACAT 1950 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGG AAT AC 2050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 2150 
TATGAGAAAG CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 22 00 
CCAATTTCTA GGCGACCTTG CGAGCATTCT ACCGAGTAAC ACCACACCGC 2 250 
TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTGAG 23 00 
AAGCTGGGTT GGT ACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC S3 SO 
AGACCTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 24 00 
CATCTAGTCA CTCAAAGACT TTAGGCTATA AGAGGCTGAC TAAAAGCAAT 2450 
CCAATCTCAT GGCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2300 
AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCX 2550 
ACTCAGCGTC GGGAAAGGAG CCATGGGTTC TAGCAACTAA CTTACCTGTT 26 00 
GAAATTCGAA CACCGAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 2650 
GATTGAAGAA ACCTTCCGAG ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 2700 
TACGCCATAG CCGAACGAGC AGCTCAGAGC GTTTTGATAT CATGCTGCTA 2750 
ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCFCA 2 BOO 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2850 
ACGTACTCTC AACAGTTCGC TTAGGCATGG A&GTTTTGCG GCATTCTGGC 2900 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCAACCC TACTAGCTCA 2 950 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3 000 
GATCACTTCT GG CT AAT AAA AGATCAGAGC TCTAGAGATC TGTGTGTTGG 3 050 
TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGGCATC TGTTGTTTGC 3100 
CCCTGCCCCG TGCCXTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3150 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 32 00 
CTATTCTGGG GGGTGGGGTG GGGGAGCACA GCAAGGGGGA GGATTGGGAA 3250 
GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTAGGTCTCT 3300 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTGT CTCTCGGTAC CTCTCTCTCT 3 3 50 
CTCTCTCTCT GTCTCTGTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCGGT GACGAAAGGT GCCTTTTATC ATCACTTTAft AAATAAAAAA 3 4 SO 
CAATTACTCA GTGGCTGTXA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3 500 
CATCACAACA AAAAGTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 35 50 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3600 
CCTATCCAAG AAGTGATGCC TATCATTGGT TGGAATGAAG TTGAAAAJUU^ 3650 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGCGATTGT CAGCAAATTG 3700 
ATCCAAGAGA AGCmACTTAA AGCTTTCCTG ACGGAATGTT AATTGTCGTT 3 750 
GACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 3S00 
TTAAGGTGGA TAGACATGTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3 850 
CACTCATTAG GCACCCCAGG CTTTACACTT TATGGTTCCG GCTCGTATGT 3900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA GAGCTATGAG 3950 
CATGATTACG CCAAGCGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 000 
GGAGCTCCAC CGCQQTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGG 4 050 
ATCAGATTGG CTATTGGCCA TTGCATAGGT TGTATCCATA TCATAATATG 4100 



IP 1 539 7B5 B1 



5 



10 
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40 



45 
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35 



TACATTTATA 
ATTGACTAGT 
CATATATGGA 
TGACCGCCCA 

A V5'/*>V» V» V? V» W> Vr** 

CATAGTAACG 
TAGGGTAAAG 
ACGGCCCCTA 
CCAGT&CATG 
TAGTCATCGC 
CGTGGATAGC 
CGTCAATGGG 
GTCGTAACAA 
TGGGAGGTCT 
GAGACGCCAT 
CGAGGCTGCG 
AAGAGTGACG 
TCTTATGCAT 
CCTTATGCTA 
ACCATTATTG 
CATAACATGG 
TCTGTCCTTC 
CGATTTATTA 
CGCAGTTTTT 
GTGTTGGGGA 
AGCCCTGGTC 
CTCCTAACAG 
CAGTGTGCCG 
GTGGAGATTG 
GCAGAAGAAG 
GGTAACTCGC 
CAGTACTCGT 
AACAGAGTGT 
AGGGATCCAC 
GGAGTTCATG 
AGTTCGAGAT 
ACCGTGAAGC 
CATGCTGTCC 
CCGCCGACAT 
TGGGAQCOCG 
GGACTCCTCG 
GCGTGAACTT 
TGGGAGGCGT 
CGAGATCCAC 
AGTTCAAGTC 
TACTACGTGG 
CATCGTGGAG 
AGGGGGCGCG 
TTACTTGCTT 
&ATGAATGGA 
ACAAATAAAG 
CTGCATTCTA 
GATACCGCTG 
AGTCGTATTA 
GAAAACCCTG 
CGCCAGCTGG 
AGTTGCGCAG 
TAAAATTCGC 
GCCG&&ATCG 
GTTGAGTGTT 
ACTCGAACGT 
CTCCGGGATC 



TTGGCTGATG 
TATTAATAGT 
GTTCCGCGTT 
ACGACCCCGG 
CCAATAGGGA 
TGCCCACTTG 



A \^\^/r\^t,\^s J t, -4. A. S^> 

AATCAATTAG 
ACATAACTTA 
CCCATTGACG 



GCAGTACATC 



TTGACGTC AA TGACGGTAAA 
ACCTTATGGG ACTTTCCTAC 
TATTACCATG GTGATGCGGT 
GGTTTGACTC ACGGGGATTT 
AGTTTGTTTT GGCACCAAAA 

CTceaeeecA ttgacgcaaa 

ATATAAGCAG AGCTCGTTTA 
CCACGCTGTT 1"£GACCTCCA 
CGGCCGGGAA CGGTGCATTG 
TAAGTACCGC CTATAGACTC 
GOT ATAGTGT TTTTGGCTTG 
TAGGTGATGG TATAGGTTAG 
ACCACTCGCC TATTGGTGAC 
CTCTTTGCCA CAACTATCTC 
AGAGACTGAC ACGG ACT GTG 

mmm ?y j*** X Jy 21. ^T^g' £ v ftf^ft ^]& ? !" ]£t 

ATTAAACATA GCGTGGGATC 
CATGGGCTGT TCTCCGGTAG 
CCATGCCTCC AGCGGCTCAT 
TGGAGGCCAG ACTTAGGCAC 
CACAAGGCCG TGGGGGTAGG 
GGCTCGCACG GCTGACGCAG 
ATGCAGGC AG CTGAGTTGTT 
GTTGCGGTGC TGTTAACGGT 
TGCTGCCGCG CGCGCCACCA 
TCCTTTCCAT GGGT CTTTTC 
CGGTCGCCAC GATGGTGCGC 
CGCTTCA&GG 7GCGC&TGGA 
CGhOGQCGhG GGGGAGGGCC 
TGAAGGTGAG CAAGGGCGGC 
CCCCAG7TCC AGTACGGGTC 
CCCCGACTAG AAGAAGCTGT 
TGATGAACTT CGAGGACGGC 
CTGCAGGACG GCTG GTTCAT 
CCCCTCCGAG GGCCGCGTAA 
CCACCGAGCG CCTGTACCCC 
AAGGCCCTGA AGCTGAAGGA 
CATCTACATG GCCAAGAAGG 
AGTCCAAGCT 
CAGTACGAGC 



GGACATCACC 
GC^GCGAGGG 



ACTCTAGATC ATAATCAGCC 
TAAAAAACCT CCCACACCTC 



ATTGTTGTTG 
CAATAGGATG 



TTAACTTGTT 
A C^AiAAk^X" *i h ^f GjA 



GTTGTGGCCC GGGCTGCAGG 
ACCTCGAGGG GGGGC CCGGT 
CGCGCGCTCA CTGGCCGTCG 
GGGTTACCCA ACXTAATCGC 
CGTAATAGCG AAGAGGCCCG 
CCTGAATGGC GAATGGAAAT 
GTTAAATTTT TGTTAAATCA 
GCAAAATCCC TTATAAATCA 
GTTCCAGTTT GGAACAAGAG 
CAAAGGGGGA AAAACCGTCT 
ATATG&CAAG ATGTGTATCC 



CCGCCATGTT 
GGGGTCATTA 
CGGTAAATGG 
TCAATAATGA 
ACGTCAATGG 
AAGTGTATCA 
TGGCCCGGCT 
TTGGC&GTAC 
TTTGGCAGTA 
CCAAGTCTCC 
TCAACGGGAC 
TGGGGGGTAG 
GTGAACCGTC 
TAGAAGAGAC 
GAACGCGGAT 
TATAGGCACA 
GGGCCTATAG 
CCTATAGGTG 
GATACTTTCC 
TATTGGCTAT 
TATTTTTACA 
ACAACGCCGT 
TCCACGGGAA 
CGGCGGAGCT 

AGCAGAATGC 
GTATGTGTCT 
ATGGAAGACT 
GTATTCTGAT 
GGAGGGCAGT 
GAGATAAT AG 
TGCAGTCACC 
TCCTGCAAGA 
GGGCACCGTG 
GCCCCTAGGA 
CCCCTGCGCT 
CAAGGTGTAG 
CGTTCCCGGA 
GGCGTGGTGA 
CTACAAGGTG 
TGCAGAAGAA 
CGCGACGGCG 
CGGCGGCGAC 
CCGTGCAGCT 
TCCCACAACG 
C GG CGAGCAG 
ATACCACATT 
CCCCTGAACC 
TATTGCAGCT 
GAAATAAAGC 
AATTCGATAT 
ACCCAATTCG 
TTTTACAACG 
CTTGCAGCAG 
CACCGATCGC 
TGTAAGCGTT 
GCTCATTTTT 
AAAGAATAGA 
TCCACTATTA 
ATCAGGGCGA 
ACCTTAACTT 



GACATTGATT 
GTTCATAGCG 
GCCGCCTGGC 
CGTATGTTCC 
GTGGAGTATT 
TATGCCAAGT 
GGCATTATGC 
ATCTACGTAT 
GATCAATGGG 
ACCCCATTGA 
TTTCCAAAAT 
GCGTGTAGGG 
AGATCGGCTG 
CGGGACCGAT 
TCCCCGTGCC 
CCCCTTTGGG 
ACCCCCGCTT 
TGGGTTATTG 
ATTACTAATC 
ATGGCAATAC 
GGATGGGGTC 
CCGCCGTGCC 
TCTCGGGTAC 
TCGACATCCG 
CAGGTCCTTG 

^lC^ \mr AkC» 

GAAAATGAGC 
TAAGGGAGCG 
AAGAGTCAGA 
QTAGTCTGAG 
CTGACAGACT 
GTCTCGCGAC 
ACGTCATGAA 
AAGGGCCACG 
GGGCCACAAC 
TCGCCTGGGA 
GTGAAGCACC 
GGGCTTCAAG 
CCGTGACCCA 
AAGTTCATCG 
GACCATGGGC 
TGCTGAAGGG 
TACCTGGTGG 
GCCCGGCTAC 
AGG ACT AGAC 
CTGTTCGTGT 
TGTAGAGGTT 
TGAAACATAA 
TATAATGGTT 
ATTTTTTTCA 
CAAGCTTATG 
CCCTATAGTG 
TCGTGACTGG 
ATCCCCCTTT 
CCTTCCCAAC 
AATATTTTGT 
TAACCAATAG 
CCGAGATAGG 
AAGAACGTGG 
TGGCCCACTA 
AATGATTTTT 



4X50 
4200 
4250 
4 3 00 
4350 
4 4 00 
44 50 
4500 
4550 
4600 
4650 
4700 
4750 
4800 
4Q50 
4 $00 
4950 
5000 
5050 
5X00 
5150 
5200 
5250 
5300 
5350 
5400 
5450 
5500 
5S50 
5600 
5650 
5700 
5750 
5800 
5S50 
5900 
5950 
6000 
6050 
6100 
6150 
6200 
6250 
63 00 

63 50 

64 00 
64 5 0 
6500 
6550 
6600 
6650 
6700 
6750 
6800 
6 850 
6 500 

6 950 
7000 

7 050 
7100 
7X50 
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ACGAAAATCA TTAGGGGATT CATCAGTGCT CAGGGTCAAC OAGMTT AAC 7200 
ATTCCGTCAG GAAAOCTTAT GATGATGATG TGCTTAAAAA CTTACTCAAT 7250 
GGCTGGTTAT GCATATCGCA ATACATGCGA AAAACCTAAA AG&GCTTGCC 73 00 
GATAAAAAAG GCCAATT7AT TGCTATTT&C CGCGGCTTTT T ATTG AG CTT 73 50 
GAAAG&TAAA TAAAATAGAT AGGTTTTATT TGAAGCTAAA TCTTCTTTAT 74 00 
CGTAAAAAAT GCCCTCTTGG GTTATCAAG& GGGTCATTAT ATTTCGCGGA 74 50 
ATAACATCAT TTGGTGACGA AATAACTAAG CACTTGTCTC CTGTTTACTC 7500 
CCCTGAGCTT GAGGGGTTAA CATGAAGGTC ATCGATAGCA GGATAATAAT 7550 
ACAGTAAAAC GCTAAACCAA TAATCCAAAT CCAGCCATGC CAAATTGGTA 7600 
GTGAATGATT ATAAATAACA GCAAACAGTA ATGGGCCAAT AACACCGGTT 76 SO 
GCATTGGTAA GGCTCACCAA TAATCCCTGT AAAGCACCTT GCTGATGACT 7700 
CTTTGTTTGG ATAGACATCA CTCCCTGTAA TGCAGGTAAA GCGATCCCAC 7 7 SO 
CACCAGCCAA TAAAATTAAA ACAGGGAAAA CTAAGGAACC TTCAGATATA 7800 
AACGCTAAAA AGGCAAATGG ACTACTATCT GCAATAAATC CGAGCAGTAC 7 8 SO 
TGCCGTTTTT TCGCCCATTT AGTGGCTATT CTTCCTGCCA CAAAGGCTTG 7900 
GAATACTGAG TGTAAAAGAC CAAGAGCCGT AATGAAAAGC CAACCATCAT 7950 
GCTATTCATC ATCACGATFF CTGTAATAGC ACCACACCGT GCTGGATTGG SO 00 
CTATCAATGC GCTGAAATA& TAATCAACAA ATGGCATCGT TAAATAAGTG 8050 
ATGTATACCG ATCAGCTTTT GTTCCCTTTA GTGAGGGTTA ATTG CGCGCT Si 00 
TGGCGTAATC ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC 8150 
ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGT& AAGCCTGGGG 82 00 
TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 8250 
CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA S3 00 
CGCGCGGGGA GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT 83 50 
CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCG&GCG GTATCAGCTC 84 00 
ACT C AAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 84 50 
AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGG&ACC GTAAA&&GGC 8500 
CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCGCCCTGAC GAGCATCACA 8550 
AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCG&CAGG ACTATAAAGA 8600 
TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC 8650 
CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG 8700 
CGCTTTCTCA TAG CTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 8750 
CGCTCCAAGC TGG G CTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG 8 800 
CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 88 50 
TATGGCCACT OGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 8900 
GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCGTAACT ACGGCTACAC 8950 
TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCC& GTTACCTTCG 9000 
GAAAAAGAGT TGGTAGCTCT TG ATCCGGCA AACAAACCAC CGCTGGTAGC 9050 
GGTGGTTTTT TTOTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC 9100 
TCAAGAAG&T CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 9150 
AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC 92 00 
ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT 9250 
ATATGAGTAA ACTTGGTCTG AC AGTT AC C A ATGCTTAATC AGTGAGGCAC 9300 
CTATCTCAGC GATGTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC 93 SO 
GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC 94 00 
TGCAATGATA CCG GGAGACC CACGCTCACC GGCTC CAGAT TTATCAGCAA 94 SO 
TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA 9500 
TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG 9550 
TTCGCCAGTT &ATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG 9600 
TGGTGTGACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA 96 SO 
CGATCAAGGC GAGTTAC&TG ATGCCCCATG TTGTGCAAAA &AGCGGTTAG 9700 
CTCCTTGGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 9750 
CACTCATGGT TATGGCAGCA CTG CAT AATT CTCTTACTGT CATGCCATCC 9800 
GTAAGATGCT TTTCTGTGAG TGGTGAGTAC TCAAC CAAGT GATTCTGAGA 98 50 
ATAGTGTATG CGGGGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA 9900 
ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT 8950 
TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC 10000 
GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA X0050 
CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG 10100 
GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA 10 ISO 
ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT 102 00 
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TTGAATGTAT TTAG&AAAAT AAA C AAA'S; AG GGGTTCCGCG CACATTTCCC 10250 
CGAAAAGTGC CAC 10263 



SEQ ID HO; 3 {PTnHod (Oval/Red) Chicken) 



CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGGGGGTGT GGTGGTTACG SO 
CGCAGCGTGA GCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CC&CGTTCGC CGGCATCAGA TTGGCTATTG 150 
GCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT TATATTGGCT 2 00 
C&TGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 250 
TAGTAATCAA TTACGGGGTC ATTAGTTC&T AGCCCATATA TGGAGTTCCG 3 00 
CGTTACATAA CTTACGGT AA ATOGCGCGCC TGGCTGACCG CCCAACGACC 3 50 
CCCGCCCATT GACGTCAATA A3X3ACGTATG TTCCCATAGT AACGCCAATA 4 00 
GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 4 50 
CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG 5 00 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA C ATGAC CTT A 550 
TGGGACTTTC CTACTTGGCM GTACATCTftC GTATTAGTGA TCGCTATTAC 60 0 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG ^50 
ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 700 
TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTGCGC ?S0 
GCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 8 00 
GCAGAGCTCG TTTAGTGA&C CGTCAGATCG CCTGGAGAGG CCATCCACGC 850 
TGTTTTGACC TCCATAGAAG AC AC CGGGAC CGATCCAGCC TCCGGGGGCG 9 0 0 
GGAACGGTGC ATTGGA&CGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
GCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGGCCT ATACAGCCCC G GTTCCTT AT GCTATAGGTG 1050 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGAC C ACT 1100 
CCCCTATTGG TGACGATACT TTCCATT ACT AATCCATAAC ATGGCTCTTT 1150 
GCCAC&ACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 1200 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTACA 1250 
AATTCACATA TACAACAACG CCOTCCCCCG TGCCCGCAGT TTTTATTAAA 13 00 
CATAGCGTGG GATCTCCACG CGAATCTCGG GTACGTGTTC CGGACATGGG 13 50 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 14 00 
CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA AC AGTGGAGG 14 S 0 
CCAGACTTAG GCACAGCACA ATGCCC&CCA CCACCAGTGT GCCGCACAAG 1 5 0 0 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTOGGCTCG XSSG 
CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA GA&GATGCAG 1600 
CCAGCTGAGT TGTTGT&TTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 
GTOCTGTTAA CGGTGG&GGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 1700 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 17 50 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT IS 00 
TTAGATGATT CTCTTTACCA ATT CTGCCCC GAATTACACT TAAAACGACT 1850 
CAACAGCTTA ACGTTGGCTT GCGACGCATT ACTTGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAA&CAT 1950 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTT AT CTGT TCGGGAATAC 2 050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG GAAAAAGGAC 2X00 
TTATGGTATT GCGAGCTTCA GTCGCACTAC AGGGTCGTTC TGTTACTCTT 2150 
TATGAGAAAG CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA A&GCTCATGA 22 00 
CCAATTTCTA GCCGACCTTG CGAGCATTCT ACCGAGTAAC ACCACACCGC 2250 
TCATTGTCAG TGATGCTGGC TTTAA^GTGC CATGGTATAA ATCCGTTGAG 2300 
AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATAtGC 23 50 
AGACCTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 2400 
CATCTAGTCA CTCAAAGACT TTAGGCTATA AGAGGCTGAG TAAAAGCAAT 2450 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 2 5 S 0 
ACTCAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAACTAA CTTACCTGTT 26 00 
GAAATTCGAA CACCCAAACA ACTTGTTA^T ATCTATTCGA AGCGAATGCA 2650 
GATTGAAGAA ACCTTCCGAG ACTTCAAAAG TCCTGCCTAC GGAGTAGGCC 2 700 
TACGCCATAG CCGAACGAGC AGCTCAGAGC GTTTTGATAT GATGCTGCTA 2750 



5$ 



EP 1 539 785 B1 



ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2800 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC &GAAATCGAA 2850 
ACGTACTCTC AAC&GTTCGC TTAGGCATGG AAGTTTTGCG GC&TTCTGGC 2 9 GO 
TACACAATAA CAAGGG&AGA CTTACTCGTG GCTGCAACCC TACTAGCTCA 2950 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3000 
GATCACTXCT GGCTAATAAA &GATCAG&GC TCTAGAGATC TGTGTGTTGG 3 050 
TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3X00 
CCCTCCCGCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 3 ISO 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3200 
CTATTCTGGG GGGTGGGGTG GGGCAGCACA GCAAGGGGGA GGATTGGGAA 32 SO 
GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 3 3 00 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTC GGTAC CTCTCTCTCT 3 3 SO 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCGGT GACCAAAGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 34 5 0 
CAATTACTCA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3500 
CATCACAACA AAA&GTGAT£ TAACAAATGG TTGGTCTGCC TTAGAAAGTA 35S0 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3600 
CCTATCCAAG AAGTGATGCC TATCATTGGT TGGAATGAAC TTGAAAAAAA 3 65 0 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGCCATTGT CAGCAAATTG 3 700 
ATCCAAGAGA ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 375 0 
GACCCTGAGC ACTGATGAAT CGCCTAATGA TTTTGGTAAA AATCATTAAG 3800 
TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3 85 0 
CACTCATTAG GCACCCCAGG CTTTACACTT T&TGCTTCCG GCTCGTATGT 39 00 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 3 950 
CATGATTACG CCAAGCGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 000 
GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA TCCCCCGGGG 4 050 
AGGTCAGAAT GGTTTCTTTA CTGTTTGTCA ATTCTATTAT TTCAATACAG 4100 
AACAATAGCT TCTATAACTG AAATATATTT GCTATTGTAT ATTATGATTG 4150 
TCCCTCGAAC CATGAACACT CCTCCAGCTG AATTTCACAA TTCCTCTGTC 4 200 
ATCTGCCAGG CCATTAAGTT ATTC&TGGAA GATCTTTGAG GAACACTOCA 4 250 
AGTTCATATC ATAAACACAT TTGAAATTGA GTATTGTTTT GCATTGTATG 4 3 00 
GAGCTATGTT TTGCTGTATC CTCAGAAAAA AAGTTTGTT& TAAAGCATTC 4 350 
ACACCCATAA AAAGATAGAT TTAAATATTC CAGCTATAGG AAAGAAAGTG 44 00 
CGTCTGCTCT TGACTCTAGT CTCAGTTGGC TCCTTCACAT GCATGCTTCT 4 4 SO 
TTATTTCTCC TATTTTGTCA AGAAAATAAT AGGTCACGTC TTGTTCTCAC 4 500 
TTATGTCCTG CCTAGCATGG CTCAGATGCA CGTTGTAGAT ACAAGAAGGA 4 550 
TCAAATGAAA CAGACTTCTG GT CTGTTACT ACAACCATAG TAATAAGCAC 4 600 
ACTAACTAAT AATTGCTAAT TATGTTTTCC ATCTCTAAGG TTCCCACATT 465 0 
TTTCTGTTTT CFFAAAGATC CCATTATCTG GTTGTAACTG AAGCTCAATG 470 0 
GAACATGAGC AATATTTCCC AGTCTTCTCT CCCATCCAAC AGTCCTGATG 475 0 
GATTAGCAGA ACAGGCAGAA AACACATTGT TACCCAGAAT TAAAAACTAA 4 800 
TATTTGCTCT CCATTCAATC CAAAATGGAC CTATTGAAftC TAAAATCTAA 4 850 
CCC&ATCCCA TTAAATGATT TCTATGGCGT CAAAGGTCAA ACTTCTGAAG 4 900 
GGAACCTGTG GGTGGGTCAC AATTCAGGCT ATATATTCCC CAGGGCTCAG 495 0 
CGGATCTCCA TGGGCTCCAT CGGTGCAGCA AGCATGGAAT TTTGTTTTGA 5000 
TGTATTCAAG GAGCTC&AAG TCCACCATGC CAATGAGAAC ATCTTCTACT 5050 
GCCCCATTGC CAT C ATGT C A GCTCTAGCCA TGGTATACCT GGGTGCAAAA SI 00 
GACAGCACCA GGGAATTCGT GCGCTCCTCC AAGAACGTCA TCAAGGAGTT 5150 
CATGCGCTTG AAGGTGCGCA TGGAGGGCAC CGTGAACGGC CACGAGTTCG 5200 
AGATCGAGGG CGAGGGCGAG GGCCGCCCCT ACGAGGGCCA CA&CACCGTG 5250 
AAGCTGAAGG TGACCAAGGG CGGCCCCCTG CCCTTCGCCT GGGACATCCT 53 00 
GTCCCCCCAG TTCCAGTACG GCTCCAAGGT GTACGTGAAG CACCCCGCCG 5350 
ACATCCCCGA GTACAAGAAG CTGTCCTTCC CCGAGGGCTT CAAGTGGGAG 5400 
CGCGTGATGA ACTTCGAGGA GGGCGGCGTG GTGACCGTGA CCCAGGACTC 54 50 
CTCCCTGCAG GAGGGCTGCT TCATCTACAA GGTGAAGTTC ATCGGCGTGA 5500 
ACTTCCCCTC CGACGGCCCC GTAATGCAGA AGAAGACCAT GGGCTGGGAG 5550 
GCCTCCACCG AGCGCCTGTA CCCCCGCGAC GGCGTGCTGA AGGGCGAGAT 5600 
CCAGAAGGCC CTGAAGCTGA AGGACGGCGG CGACTACCTG GTGGAGTTCA 565 0 
AGTCCATCTA CATGGCGAAG AAGCCCGTGC AGCTGCCCGG CTACTACTAC 5700 
GTGGACTCCA AGCTGGACAT CACCTCCCAC AACGAGGACT ACACCATCGT 5750 
GGAGCAGTAC GAGCGCACGG AGGGCCGCCA CCACCTGTTC CTGTAGCGGC 5S00 
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CGCGACTCTA GATCATAATC AGCCATACCA GATTTGTAGA GGTTTTACTT 5850 
GCTTTAAAA& ACeTCCCACA CCTCCCCCTG AACCTGAAAC ATAAAATGAA 5500 
TGCAATTGTT GTTGTTAACT TGTTTATTGC &GCTTATAAT GGTTACAAAT 5950 
AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 6 GOO 
TCTAGTTGTG GCTCGAGAAG GGCGAATTCT GCAGATATCC ATCACACTGG 6050 
CGGCCGCTCG AGGGGGGGCC COGT&CeCAA TTCGCCCT&T AGTGAGTCGT 6X00 
ATTACGCGCG CTC ACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAA&C 6150 
CCTGG CGTT A CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG 6200 
CTGGCGTAAT AGCGAAGAGG CCGGGACCGA TCGCCCTTCG CAACAGTTGC 6250 
G C AG CCTGAA TGGCGAATOG &AATTGTAAG CGTTAATATT TTGTTAAAAT 6 3 00 
TCGCGTTAAA TTTTTGTTAA ATCAGCTCAT TTTTTAACCA ATAGGCCGAA 6350 
ATCGGC'AAAA TCCCTTATAA ATCAA&AGAA TAGACCGAGA TAGGGTTGAG 64 00 
TGTTGTTCCA GTTTGGAACA AG AGTCC ACT ATTAAAGAAC GTGGACTCCA 6 4 50 
ACGTCAAAGG GCGAAAAACC GTCTATCAGG GCGATGGCCC ACTACTCCGG 6 £00 
GATCATATGA CAAGATGTGT ATCCACCTTA ACTTAATGAT TTTTACCAAA 6550 
ATCATTAGGG GATTCATCAG TGCTCAGGGT CAACOAGAAT TAACATTCCG 6600 
TCAGGAAAGC TTATGATGAT GATGTGCTTA &AAA.CTTACT CAATGGCTGG 6650 
TTATGCATAT CGCAAT&CAT GCGAAAAACC TAAAAGAGCT TGCCG ATAAA 6700 
AAAGGCCAAT TTATTGCTAT TTAGCGCGGC TTTTTATTGA GCTTGAAAGA 6750 
TAAATAAAAT AGATAGGTTT TATTTGAAGC TAAATGTTCT TTATCGTA&A 6800 
AAATGCCCTC TTGGGTTATC AAGAGGGTCA TTATATTTCG CGGAATAACA 
TCATTTGGTG ACGAAATAAC TAAGCACTTG TCTCCTGTTT ACTCCCCTGA 6900 
GCTTGAGGGG TTAACATGAA GGTCATCGAT AGCAGGATAA TAATACAGTA 6250 
AAACGGTAAA CCAATAATCC AAATCGAGCC ATCCCAAATT GGT AGTGAAT 7000 
GATTATAAAT A AC AG C A A AC AGTAATGGGC CAATAACACC GGTTGCATTG 7 050 
GTAAGGCTCA CCAATAATCC CTGTAAAGCA CCTTGCTGAT GACTCTTTGT 7100 
TTGGATAGAC ATCACTCCCT GTA&TGCAGG TAAAGCGATC CCACCACCAG 7160 
CCAATAAAAT TAAAACAGGG AAAACTAACC AACCTTCAGA TATAAACGCT 72 00 
AAAAAGGCAA ATGCACTACT ATCTGCAAT A AATCCGAGCA GTACTGCGGT 7 2 5 0 
TTTTTCGCCC CATTTAGTGG CT ATTCTTCC TGCCACAAAG GCTTGGAATA 7300 
CTGAGTGTAA AAG ACCAAGA CCCGCTAATG AAAAGCCAAC CATCATGCTA 7350 
TTCCATCCAA AACGATTTTC GGTAAATAGC ACCGAGACCG TTGCGGGAAT 74 00 
TTGGCCT AT C AATTGCGGTG AAAAATAAAT AATCAACAAA ATGGCATCGT 74 SO 
TTTAAATAAA GTGATGTATA CCGAATTCAG CTTTTGTTCC CTTTAG TG AG 75 00 
GGTTAATTGC GCGCTTGGCG TAATGATGGT CATAGCTGTT TCCTGTGTGA 7S50 
AATTGTTATG CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGC ATAAA 7 6 00 
GTGTAAAGCC TGGGGTGCCT AATGAGTG&G CTA&CTCACA TTAATTGCGX 7£S0 
TG CGCTC ACT GCCCGCTTTC CAGT CGGGAA ACGTGTGGTG CCAGCFGCAT 7700 
TAATGAATCG GCChACGCGC GGGGAGAGGC GGTTTG CGT A TTGGGCGCTC 77 SO 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 7800 
GAGCGGTATC AGCTCACTCA AAGGCGGT&A TACGGTTATC CACAGAATCA 7850 
GGGGATAACG CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG 7 900 
GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 7 950 
CTGACGAGCA TCACAAAAAT CCACGCTCA^A GTCAGAGGTG GCGA&ACCCG 8000 
ACAGGACTAT AAAGATAGGA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG 6050 
CTCTCCTGTT CCGACCCTGC GGCTTACCGG ATACCTGTCC GCCTTTCTCC 8100 
CTTCGGGAAG CGTGGCGCTT TCTGATAGCT CACGCTGTAG GT ATCTCAGT 6150 
TCGGTGTAGG TCGTTCGCTC GAAGCTGGGG TGTGTGGACG AACGCCCCGT 8200 
TCAGCGGGAC CGCTGCGCCT TATCCGGTAA CT ATCGT CTT GAGTCCAACC 8250 
CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAAC AGGATT 8 3 0 0 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC 8350 
TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 04 00 
AGCCAGTTAC CTT CGG A AAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA 84 50 
ACCACCGCTG GTAGGGGTGG TT*n"TTTGTT TGCAAGCAGC AGATTACGCG 6600 
CAGAAAAAAA GGATCTCA&G AAGATCCTTT GATCTTTTCT AGGGGGTGTG 8550 
ACGCTCAGTG GAACGAAAAG TCACGTTAAG GGATTTTGGT CATGAGATTA S500 
TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA 8650 
ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 8700 
TAATCAGTGA GGCACCTATG TCAGCGATCT GTCTATTTCG TTCATCCATA 67 50 
GTTGCCTGAC TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC SSOO 
ATCTGGCCCC AGTGCTGGAA TGATACCGCG AGACCCACGG TCACCGGCTG 8650 
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CAGATTT&XC AGCAATAAAC CAGCCAGCCG GAAGGGGCGA GCGGAG&AGT 8 900 
GGTCCTGCAA CTTTATCCGC GTCCATCCAG TCTATTAATT GTTGGCGGGA 8950 
AGCTAGAGTA AGTAGTXCGC CAGXXAATAG TTTGCGCAAC GTTGTTGCCA §000 
TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTXCATTC 9050 
AGCTCCGGTT CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG 9100 
CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC AGAAGTAAGT 9 ISO 
TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAMTCTCTT 9200 
ACTGTCATGC CATCGGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC 9250 
CAAGXCATTC TG&GAATAGX GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 9300 
CGTCAATACG GGATAAT&CC GCGCCACATA GCAGAACTTT AAAAGTGCTC B3S0 
ATCATTGGAA AACGTTCTXC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT 94 00 
GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG 94 SO 
CATCTTTXAC TTTCACCAGC GTTTCTGGGX GAGCAAAAAC AGGAAGGCAA 9500 
AAXGCGG C AA A&AAGGGAAT AAGGGCGACA GGGA&ATGTT GAATAGTCAT 9550 
ACTCTTCCTT TTTCAAXAXT ATTGAAGCAT TTATCAGGGT TATTGTCTCA 9600 
TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATA&ACA AAXAGGGGTT 9650 
CCGCGGACAT TTCCCCGAAA AGTGCCAC 9678 



SEQ ID NO: 4 (PTnMod (Oval/Red) Quail) 

CTGACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGXGT GGTGGTTACG 50 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTGG CCACGTTCGC CGGCATCAGA TTGGCTATTG 150 
GCCATXGCAT ACGTTGTATC GATATCATAA TATGTACATT T AT ATTGGCT 200 
CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 250 
TAGTAATCAA TTACGGGGXG ATTAGTXCAX AGCCCATATA TGGAGTTCCG 300 
CGTTACATAA CTTACGGTAA AXGGCCCGCC TGGCTGACCG GGCAACGACC 3 50 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 4 00 
GGGACTTTCG ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 450 
CTTGGGAGTA GATCAAGTGT ATGAXATGCC AAGTACGCCC CCTATTGACG 500 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 550 
TGGGAGTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 6 00 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 650 
ACTCACGGGG ATTTCCAAGT CTGCACCCCA TTGACGTCAA TGGGAGTTTG 700 
TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 75 0 
CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 800 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 850 
TGTTTTGACC TGCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 900 
GGAAGGGTGC ATTGGAACGC GQATTCCCCG TGCCAAGAGT GACGTAAGTA 950 
CCGCCTATAG ACXCXATAGG CACACCCCTT TGGCTCTTAT GCATGCTAXA 1000 
CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 1050 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCACT 1100 
CCCCTATTGG TGACGATACX TTCCATTACT AATCCATAAC ATGGCTCTTT 1150 
GGCACAACTA TCTCTATTGG CXATATGCCA ATACTCTGTC CTTCAGAGAC 1200 
TGACACGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTACA 1250 
AATTCACATA TACAACAACG CCGTCCCCCG TGGCCGCAGT TTTTATTAAA 1300 
CATAGCGTGG GATCTCCACG CGAATCTCGG GTACGTGTTC GGGACATGGG 13 50 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 14 00 
CTCCAGCGGC TCATGGTCGC TCGGCAGGTC CTTGCXCCTA ACAGTGGAGG 14 50 
GGAGACTTAG GCACAGCACA ATGCCCACCA CCACCAGTGT GCCGCACAAG 1500 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GhGCGTGGhG ATTGGGCTCG 1550 
CACGGCTGAC GCAGAXGGAA GACTTAAGGC AGCGGCAGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTG TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1S50 
GTGGTGTTAA GGGTGGAGGG C&GTGTAGTC TGAGCAGTAC TCGTTGGTGC 1700 
CGCGGGCGCC ACCAGAGATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGCAGT CACGGTCGGA CCAXGTGTGA ACTTGATATT 1800 
TTACATGATT CTCTTTAGCA ATTGTGCCCC GAATTACACT TAAAACGACT 1350 
CAACAGGTTA ACGTTGGCTT GCCACGCATT ACTXGACTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCXGCCAA CCAAAGCGAG AACAAAACAX 1950 
AACATCAAAC GAATGGACCG ATTGTTAGGT AAXCGTCACC TCCACAAAGA 2000 
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GCGACTCGCT GTATACCGTT CGCATGCTAG 
GATGCCCATT GTACTTGTOG ACtGGTCTGA 
TTATGGTATT GCG AGCTTCA GTCGCACTAC 
TATGAGAAAG CGTTCCCGCT TTCAGAGCAA 



OTTTATCTGT TGGGGAATAC 2 050 
TATTCGTGAG CAAAAACGAC 2100 
ACGGTCGTTC TGTTACTCTT 2150 
TGTTCAAAGA AAGCTCATGA 2200 



CCAATTTCTA GC CGACCTTG CGAGCATTCT ACCGAGTAAC ACCACACGGC 2250 
TCATTGTGAG TQATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTGAG 2 3 00 
AAGGTGGGTT GGTACTGGTT AAGTCGAGT& AGAGGAAAAG T&CAATATGC 2 3 5 0 
AGACGTAGGA GCGQAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 2400 
CATCTAGTCA CTC&AAG&CT TTAGGCTATA AGAGGCTGAC TAAAAGCAAt 24 SO 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TGTCGGTCTA AAGGCCGAA& 25 00 



AA&TCAGCGC TCGACACGGA 

ACTCAGCGTC GGCAAAGGAG 

GAAATTCGAA CACCCAAACA 

GATTGAAGA& ACCTTCGGAG 

TACGCCATAG CCGAACGAGC 



CTCATTGTCA CCACCCGTGA CCTAAAATCT 2550 
CCATGGGTTC TAGCAACTAA CTTACCTGTT 2600 
ACTTGTTAAT ATCTATTCGA AGCGAATGGA 2650 
ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 2700 
AGCTCAGAGC GTTTTGATAT CATGCTGCTA 2 7 50 
ATCGGGGTGA TGCTTCAAGT AACATCTTGG CTTGCGGGCG TTCATGCTCA 2 600 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2 850 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2 900 
TACACAATAA CAAGGGAAGA CTTACTCGTG GCTGCAACCC TACTAGCTGA 2 950 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3000 
GATCACTTCT GGCTAATAAA AGATCAGAGC TCTAGAGATC TGTGTGTTGG 3 050 
TTTTTTGTGG ATGTGCTGTG CCTTCTAGTT GGGAGCCATC TGTTGTTTGG 3100 
CCCTCCCCCG TGCCTTCCTT GACCCTGG AA GGTGCCACTC CCACTGTCCT 3150 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 32 00 
CTATTCTGGG GGGTGGGGTG GGGCAGCACA GCAAGGGGGA GGATTGGGAA 3250 
GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTGTATGG GTACCTCTGT 3 3 00 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTGT 33 50 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCGGT GACCAAAGGT GCCTTTTATC ATCACTTTAA AAAT AAAAAA 3 4 5 0 
CAATTACTCA GTGCCTGTTA TAAGGAGGAA TTAATTATGA TTG ATG OCT A 3500 
CAT C AC AAC A AAAACTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3550 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3 600 
CCTATCCAAG AAGTGATGCC TATCATTGGT TGGAATGAAC TTGAAAAAAA 3 6 S 0 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGGCATTGT CAGCAAATTG 3700 
A7CCAAGAGA ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 3 7 SO 
GACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA &ATCATTAAG 30 00 
TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3S50 
CACTCATTAG GCACCGCAGG CTTTACACTT TATGCTTCGG GCTCGTATGT 3 300 
TGTGTGGAAT TGTGAGCGGA TAACA&TTTC ACACAGGAAA CAGCTATGAC 3 550 
CATGATTACG CCAAGCGCGC AATTAACCCT CACTAA&GGG AACAAAAGGT 4 000 
GGAGCTCCAC CGCGGTGGGG GCCGCTCTAG AACTAGTGGA TCCCCCGGGG 4 050 
AGGTCAGAAT GGTTTCTTTA CTGTTTGT C A ATTCTATTAT TTCAAT ACAG 4 1 0 0 
AACAAAAGCT TCT&TAACTG AAATATATTT GCTATTGT&T ATTATGATTG 4150 
TCCCTCGAAC CATGAACACT CCTCCAGCTG AATTTCACAA TTCCTCTGTC 42 00 
ATCTGCCAGG CTGGAAGATC ATGGAAGATC TCTGAGGAAC ATTGCAAGTT 4 2 50 
CATACCATAA ACTCATTTGG AATTGAGTAT TATTTTGCTT TGAATGGAGC 43 00 



TATGTTTTGC AGTTCCCTCA 
CCATCAAAAG ATATATTTAA 



GAAGAAAAGC TTG TT AT AAA GCGTCTACAC 4 350 
ATATTCCAAC TACAGAAAGA TTTTGTCTGC 44 00 
TCTTCACTCT GATCTCAGTT GGTTTCTTCA CGTACATGCT TCTTTATTTG 44 50 
CCTATTTTGT CAAGAAAATA ATAGGTCAAG TGGTGTTCTC ACTTATCTCC 4500 
TGCCTAGCAT GGCTTAGATG CACGtTGTAC ATTCAAGAAG GATCAAATGA 45 50 
AACAGACTTC TGG'TCTGTTA GAACAACGAT AGTAATAAAC AGACTAACTA 4600 
ATAATTGCTA ATT ATGTTTT CCATGTGTAA GGTTCCGACA TTTTT CTGTT 4 650 
TTAAGATCGC ATTATCTGGT TGTAACTGAA GCTCAATGGA ACATGAAC AG 4 7 0 0 
TATTTCTGAG TGTTTTCTCC AGCAATCCTG ACGGATTAGA AGAACTGGCA 4 7 SO 
GAAAACACTT TGTTACCCAG AATTAAAAAC TAATATTTGC TCTCCCTTCA 
ATCCAAAATG GACCTATTGA A&CT&AA&TC TGAGGGAATC CCATTAAATT 4850 
ATTTGTATGG CGTCAAAGGT GAAACTTTTG AAGGGAACCT GTGGGTGGGT 4900 
CCCAATTCAG GCTATATATT CCCCAGGGCT CAGCGGATGT CCATGGGCTC 4 950 
CTCGTGCAGC AAGGATGGAA TTTTGCCTTG ATGTATTCAA GGAGCTCAAA 5000 
GTCCACCATG CCAATGACAA CATGCTCTAC TCCCCCTTTG CGATCTGTCA 5050 
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ACTCTGG CCA TGGTCTCCCT GGGTGCAAAA GACAGCACCA GGGAATTCGT S10D 
GCGCTCCTCC AAGAACGTCA TCAAGGAGTT CATGCGGTTC &AGGTGCGCA 5150 
TGGAGGGCAC GGTGAACGGC CACGAGTTCG AGATCGAGGG CGAGGGCGAG $200 
GGCGGCCCCT ACGAGGGGCA CAACACCGTG AAGCTGAAGG TGACCAAGGG S3 SO 
CGGCCCCCTG CCCTTCGCCT GGGACATCCT GTCCCCCCAG TTCCAGTACG S3 00 
QCTCC&&GGT GTACGTGAAG CACCCCGCCG ACATCCCCGA GTACAAGAAG S3 SO 
CTGTCCTTCG CCGAGGGCTT CA&GTGGGAG CGCGTGATGA ACTTCGAGGA S4 00 
CGGCGGCGTG GTGACCGTGA CCCAGGACTC CTCCCTGCAG GACGGCTGCT S4 50 
TCATCTACAA GGTGAAGTTC ATCGGCGTGA ACTTCCCCTC CGACGGCCCC 5500 
GTAATGCAGA AGA&GACCAT GGQCVGGGhG GCCTCCACCG AGCGCCTGTA 5550 
CCCCCGCGAC GGCGTGCTGA AGGGCGAGAT CCACAAGGCC CTGAAGCTGA 56 00 
AGGACGGCGG CCACTACCTG GTGGAGTTCA AGTCCATCTA CATGGCCAAG 56 5 0 
AAGCCCGTGC AGCTGCCCGG CTACTACTAC GTGGACTCCA AGCTGGACAT S700 
CACCTCCCAC AACGAGGACT ACACCATCGT GGAGCAGTAC GhGGGChCCG 5750 
AGGGCCGCCA GCACCTGTTC CTGTAGCGGC CGCGACTCTA GATCATAATC 58 00 
AGCCATACCA CATTTGTAGA GGTTTTACTT GCTTTAAAAA ACCTCCCACA SS50 
CCTCCCCCTG AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT S900 
TGTTTATTGC AGCTTATAAT GGTTACAAAT AAAGCAATAG CATCAC&AAT 595 0 
TTCACAAATA AAGCATTTTT TTCACTGCAT TCTAGTTGTG GCTCGAGAAG 6000 
GGCGAATTCT GCAGATATCC ATCACACTGG CGGCCGCTCG AGGGGGGGCG 6050 
CGGT&CCGAA TTGGCCCTAT AGTGAGTCGT ATTACGCGCG CTCACTGGCC 6100 
GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 6 ISO 
TCGCCTTGCA GGACATGCCC CTTTCGCGAG CTGGCGTAAT AGCGAAGAGG 6200 
CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG €2 SO 
AAATTGTAAG CGTTAAT&TT TTGTTAAAAT TCGGGTTAAA TTTTTGTTAA 63 00 
ATCAGCTCAT TTTTTAACCA ATAGGCCGAA ATCGG CAAAA TCCCTTATAA 63 50 
ATCAAAAGAA TAGACCGAGA TAGGGTTGAG TGTTGTTCCA GTTTGG&ACA 64 00 
AGAGTCCACT ATTAAAGAAC GTGGACTCCA &CGTCAAAGG GCGAAAAACC 6450 
GTCTATCAGG GGGATGGCCC ACTACTCCGG GATCATATGA CAAGATGTGT 65 00 
ATCCACCTTA ACTTAATGAT TTTTACCAAA ATCATTAGGG GATTGATCAG 6550 
TGCTCAGGGT CAAGGAGAAT TAACATTCCG TCAGGAAAGC TTATGATGAT 66 00 
GATGTGCTTA AAAACTTACT CAATGGCTGG TTATOCATAT CGCAATAGAT 66 SO 
GCGAAAAACC TAAAAGAGCT TGCCGATAAA A&AGGCCAAT TTATTGCTAT 6700 
TTACCGCGGC TTTTTATTGA GCTTGAAAGA TAAATAAAAT AGATAGGTTT 675 D 
TATTTGAAGC TAAATGTTCT TTATCGTAAA A&ATGCCCTC TTGGGTTATC 6 800 
AAGAGGGTCA TTATATTTCG CGGAATAACA TCATTTGGTG ACGAAATAAC 6850 
TAAGCACTTG TCTCCTGTTT ACTCCCCTGA GCTTGAGGGG TTAACATGAA 6900 
GGTCATCGAT AGCAGGATAA TAATACAGTA AAACGCTAAA CCAATAATCG 6950 
AAATGCAGCC ATCCCAAATT GGTAGTGAAT GATTAT&A&T AACAGCAAAC 7000 
AGTAATGGGC CAATAACACC GGTTGGATTG GTA&GGCTCA CCAATAATCG 7 050 
CTGTAAAGCA CCTTGCTGAT GACTCTTTGT TTGGATAGAC ATCACTCCCT 7100 
GTA&TGCAGG TAAAGGGATC CC&CCACCAG CCAATAAAAT TAAAACAGGG 7150 
AAAACTAAGC AAGCTTCAGA TATAAACGCT AAAAAGGCAA ATGCACTACT 72 00 
ATCTGCAATA AATCCGAGCA GTACTGCCGT TTTTTCGCCC CATTTAGTGG 725 0 
CTATTCTTCC TGGCACAAAG GCTTGGAATA CTGAGTGTAA AAGACCAAGA 73 00 
CCCGCTAATG AAAAGCCAAC CATCATGCTA TTCCATCCAA AACGATTTTC 73 SO 
GGTAAATAGC AGCCACAGCG TTGCGGGAAT TTGGCCTATC AATTGCGCTG 74 00 
AAAAATAAAT AATCAACAAA ATGGGATCGT TTTAAATAAA GTGATGTATA 74 50 
CCGAATTGAG CTTTTGTTCC CTTTAGTGAG GGTTAATTGC GCGCTTGGCG 7500 
TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 755 0 
TCCACACAAC ATACGAGCCG GAAG CAT AAA GTGTAAAGGC TGGGGTGCCT 7600 
AATGAGTGAG CTAACTCACA TTAATTGOGT TGCGCTCACT GCCCGCTTTC 7650 
CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAAGGCGC 7700 
GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 7750 
ACTCGCTGCG CTCGGTGGTT CGGCTGCGGC GAGCGGTATC AGGTGACTCA 78 00 
AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA 7 850 
CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAAGGGTAAA AAGGCCGCGT 7 900 
TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT 7 950 
CGAGGCTCAA GTCAGAGGTG GCGAAACCCG &CAGGACTAT AAAGATACCA 8000 
GGCGTTTGCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC BOS0 
CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 8100 



58 



EP 1 539 785 B1 



TCTCATAGCT CACGCTGTAG GmTGTCAGT TCGGTGTAGG TCGTTCGCTC 8150 
CAAGCTGGGC TGTGTGC&CG AACCCCCCGT TCAGCCCGAC CGCTGGGCCT 8200 
TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGAGA CGACTT ATCG 6250 
CCACTGGCAG CAGCCACTGG TJ^CAGGATT AGCAGAGCGA GGTATGTAGG S3 00 
CGGTGGTACA GAGTTCTTG& ^GTGGTGGCC TAACTACGGC TACACTAGAA 8 S S 0 
GGACAGTATT TGGTATCTGC GCTCTGCTGA &GCCAGTT&C CTTCGGA&AA 8400 
AGA&TTGGTA GCTCTTG ATC CGGCAAACAA ACCACGGGTG GT&GCGGTGG 84 SO 
TTTTTTTGTT TGCAAGCAGC AOATTACGCG C AG AA&AAA& GGATCTCA&G 8500 
AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC &5S0 
TCACGTTAAG GGATTTTGGT CATGAGATTA TC A AA AAGG A TCTTCACCTA 8600 
GATCCTTTTA A&TTAAAAAT GAAGTTTTA& ATC AATCTAA AGTATATATG 8650 
&GY A&&CTTG GTCTGACAGT TACCA&TGCT TAATCAGTGA GGCACCTATC 8700 
TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT 8750 
GT AG AT AACT ACGATACGGG AGGGCTTACC ATCTGGCCGC AGTGCTGCAA 8800 
TGATACCGCG AGACCCACGC TCACCGGCTC CAG ATTT ATC AGCAAT AAAC 8 850 
CAGCCAGCCG GAAGGGCCG& GGGCAGAAGT GGTCCTGCAA CTTTATCCGC 8900 
CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC S9S0 
CAGTTAATAG TTTGCGCAAC GTTGTTG CCA TTGCTACAGG CATCGTGGTG 5000 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCGGGTT CCC AACGATC 9050 
k&GG CG AGTT ACATGATCCC CGATGTTGTG CAAAAAAGCG GTTAGCTCCT 9100 
TCGGTCCTCC GAT CGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC 91 SO 
ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTOTCATGC CATCCGTAAG 9200 
ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 9250 
GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTGAATACG GG AT AATACC 9300 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC 9350 
GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATGG AGTTGGATGT 94 00 
AACCCAGTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC 9450 
GTTTCTGGGT GAGCAAA&AC AGGAAGGCAA AATGCCGCAA &AAAGGGAAT S5G0 
AAGGGCGACA CGGAAATGTT GAATACTCAT &GTCTTCCTT TTTCA&TATT 9550 
ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTT GAA 9600 
TGTATTTAGA AAAATAAACA AATAGGGGTT CCGGGCACAT TTCCGCGAAA 9650 
AGTGCCAC 9658 



SEQ ID NO:5 (spacer) 
(GPGG} X 

SEQ \D NO:6 (spacer) 

GPGGGPGGGPGG 

SEQ ID NO:7 (spacer) 

GGGQBGGQG3GGGG3 

SEQ ID NO:8 (spacer) 

GGGGSGGGGSGGGGSGGGGS 

SEG ID NO:9 {enterokinase cleavage site) 

DDDDK 

SEQ ID NO:10 (altered transposase Hef forward primer) 

ATCTCGAGACCAfGVGTGAACTTGATATTTTACATGATTCTCTTTACC 

SEQ ID NO:1 1 (altered transposase Her reverse primer) 

GATTGATCATTATGATAATTTCCCCAAAGCGTAACC 

SEQ ID NO: 1 2 (Xho \ restriction site) 

CTCGAG 

SEQ ID NO: 13 {modified Kozak sequence) 
ACCATG 

SEQ ID NO: 14 (Bel \ restriction site) 
TGATCA 

SEQ 3D NO: 15 (CMVf-NgoM IV primer) 

TTGCCGGCATCAGATTGGGTAT 

SEQ 3D NO:18 (Syn-polyAr-BstE SI primer) 

AGAGGT0ACGGGGTCAATTC7TCAGGACGTGGTA 
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SEQ ID NOxll {vitellogenin promoter} 

TGAATGTGTT CTTGTGTTAT CAATATAAAT CACAGTTAGT GATGAAGTTG GCTCC&&GCC 
TGCATCAGTT CAGCTACTTG GCTGCATTTT GTATTTGGTT GTGTAGGAAA TGCAAAAGGT 
TCTAGGCTGA CCTGCACTTC TATCCCTCTT GCCTTACTGC TGAGAATCTC TGCAGGTTTf 
AATTGTTCAC ATTTTGCTCC GATTTACTTT GGAAGATAAA ATATTTACAG AATGCTTATG 
AAACCTTTGT TCATTT&AAA ATATTCCTGG TCAGCGTGAC CGGAGCTGAA AGAACACATT 
GATCCCGTGA TTTCAATAAA TAGATATGTT CCATATATTG 7TTCTCAGTA GCCTCTTAAA 
TGATGTGCGT TGGTGCACAT ATGAATACAT GA&TAGCAAA GGTTTATCTG GATTACGCTC 
TGGCGTGCAG GAATGGCGAT AAACCAAAGC TGAGGGAAGA GGGAGAGTAT AGTCAATGTA 
GATTATACTG ATTGCTGATT GGGTTATTAT CAGCTAGATA ACAAGTTGGG TCAGGTG CCA 
GGTCAACATA ACCTGGGCAA AAGCAGTCTC ATCTGTGGCA GGACCATGTA CCAGCAGCCA 
GCCGTGACCC AATCTAGGAA AGCAAGTAGC ACATCAATTT TAAATTTATT GTAAATGGCG 
TAGTAGAAGT GTTTTACTGT GATACATTGA AACTTCTGGT CAATCAGAAA AAGGTTTTTT 
ATCAGAGATG CCAAGGTATT ATTTGATTTT CTTTATTCGC CGTGAAGAGA ATTTATGATT 
GCAAAAAGAG GAGTGTTTAC ATAAACTGAT AAAAAACTTG AGGAATTCAG CAGAAAACAG 
CCACGTGTTC CTGAACATTC TTCCATAAAA GTCTCACCAT GCCTGGC AG A GCCCTATTCA 
CCTTCGCT 



SEQ ID NO: IS (vitellogenin targeting sequence) 

ATGAGGGGGATCATACTGGCATTAGTGCTCACCCTTCSTAGGCAGCCAGAAGTTTGACATTGGT 
SEQ ID NO: 19 {pi 48 prolesri) 
KY KKALKKLAKLL 

SEQ ID NO:20 (p1 46 coding sequence) 
AAATAGAAAAAAGCACTGAAAAAACTGGCAAAACTGCTG 



SEQ ID NO:2l (pro -insulin sequence) 

TTTGTGAACCAACACCTGTGCGGCTCACACCTOGTGGAAGCTCTCTACCTAGTGTGCGGGGAACGAGGC 
TTCTTCTACACACGCAAGACCCGGCGGGAGGCAGAGGACCrGCAGGTGGGGCAGGTGGAGCTGGGCGGG 
GGCCCTGGTGCAGGCAGCCTGCAGCCCTTGGCCCTGGAGGGGTCCCTGCAGAAGCGTGGCATTGTGGAA 
CAATGCTGTACCAGCATCTGCTCCCTCTACCAGCTGGAGAACTCTGCAACTAG 



SEQ ID HQ; 22 (TAG sequence) 



Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp 
Ala Fro Ala Asp Asp Ala Fro Ala Asp Asp Ala Thr Thr Cys I&e Leu Lys Gly Set Cys 
Gly Trp lie Gly Leu Leu Asp Asp Asp Asp Lys 



BEG ID NG:23 (gp41 epitope) 

Ala Thr Thr Cys Me Leu Lys Gly Ser Cys Gly Trp He Gly Leu Ley 



SEQ ID HO; 24 {polynucleotide sequence encoding gp^x epitope) 

Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thr Cys lie Leu Lys Gly 
Ser Cys Gly Trp lie Gly Leu Leu Asp Asp Asp Asp uys 



SEQ ID NO:25 (repeat domain in TAG spacer sequence) 
Pro Ala Asp Asp Ala 
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SEQ ID MO : 26 (TAG spacer sequence) 

Fro Ala Asp Asp Aia Pro Aia Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp 
Aia Pro Ala Asp Asp Aia Pro Ala Asp Asp 



SEQ ID NO: 27 (Vit pro/Vit targ/TAG/pro- insulin/ syrithet ic polyA} 



TGAATGTGTT CTTGTGTTAT CAATATAAAT 
TGCATCAGTT CAGCT&CfTG GCTGCATTTT 



CAGAGTTAGT GATGA&GTTG GOTGCAAGCC 
GTATTTGGTT CTGTAGGA&A TG G&&A&GGT 
TGTAGGCTGA CCTGCACTTC TATCCCTCTT GCCTTACTGC TGAGAATCTC TGCAGGTTTT 
AATTGTTCAC ATTTTGCTCC CATTTACTTT GGAAGATAAA ATATTTACAG AATGCTTATG 
AAACCTTTQT TCATTTAAAA ATATTCCTGG TCAGCGTGAC CGGAGCTGAA AGAACACATT 
GATGCCGTGA TTTCAATAAA TACATATGTT CCATATATTG TTTGTCAGT& GCCTCTTAAA 
TCATGTGCGT TGGTGCACAT ATGAAT&CAT GAATAGCAAA GGTTTATCTG GATTACGCTC 
TGGCCTGCAG GAATGGCCAT AAACCAAAGC TG&GGGA&GA GGGAGAGTAT AGTCAATGTA 
GATTATACTG ATTGCTGATT GGGTTATTAT CAGCTAGATA ACAACTTGGG TCAGGTGCCA 
GGTCAACATA AGCTGGGCAA AACCAGTCTC ATCTGTGGCA GGACCATGTA CCAGCAGCCA 
GCCGTGACCC AATCTAGGAA AGCAAGTAGC ACATCAATTT TAAATTTATT GTAAATGCCG 
TAGTAGAAGT GTTTTACTGT GATACATTGA AACTTCTGGT CAATCAGAAA AAGGTTTTTT 
ATCAGAGATG CCAAGGTATT ATTTGATTTT CTTT ATTCG C CGTGAAGAGA ATTTATGATT 
GCAAAAAGAG GAGTGTFTAC ATAAACTGAT AAAAAACrTG AGGAATTCAG CAGAAAACAG 
CCAGGTGTTC CTGAACATTC TTC C ATAAAA GTCTCACCAT GCCTGGCAGA GCCCTATTCA 
CCTTCGCTAT GAGGGGGATC ATACTGGCAT TAGTGCTCAC CCTTGTAGGC AGGCAGAAGT 
TTGACATTGG TAGACTGAGA ATGGCAAGAA GAMG AG AAGA TGGTTTGTG AACCAACACC 
TGTGCGGCTCA CACCTGGTGG AAGCTCTCTA CCTAGTGTGCG GGGAACGAGG CTTCTTCTAC 
ACACCCAAGA CCCGCCGGGA GGCAGAGGAC CTGCAGGTGGG GCAGGTGGAG CTGCGCGGGG 
GCCCTGGTGC AGGCAGGCTG CAGCCCTTGG CCCTGGAGGGG TCCCTGCAGA AGCGTGGCAT 
TGTGGAACAA TGCTGTACCA GO ATCTGCT C CCTCT ACCAGC TGGAGAACTA CTGC&ACTAG 
GGCGCCTGGATCCAGATCACTTCTGGCTAATAAAAGATCAGAGCTCTAGAGATCTGTGTGTTGGTTTTT 
TGTGGATCTGCTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACC 
CHY^AAGGTGCCACT 



CATGCTGGGGATGCGGTGGGCTCTATGGGTACCTCTCTCTCrCTCTCTCTCTCTCTCTCTCIXTCrCTC 
TCTCGGTACCTCTGTC 



SEQ 10 190:. 2 8 (synthetic polyA sequence) 

GGCGCCTGGATCCAG&XCACTTCTGG 

TGTGGATOTGCrGTGCGTTCTJ^^ 

CTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGG 
TGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGCAmGCAAGGGGGAGGATTGGGAAGACAATAGCAGG 

CATGCTGGGGATGCGGTGGGCTCTATGGGTACCTCTCTCTC 
TGTCGGTACCTCTCTC 
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SEQ ID i$Q;29 (pTnMod (Oval /£NT £&g/PX4S/PA) ~ Chicken) 

CTG&CGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 50 
CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 
TTTCTTCCCT TCCTTTCTCG CCACGTTCGC CGGCATCAGA TTGGCTATTG ISO 
GCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT TATATTGGCT 200 
CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTG&C TAGTTATTAA 250 
TAGTAATCAA TTACGGGGTG ATTAGTTCAT AGCCCATATA TGGAGTTCCG 3 00 
CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 3 50 
CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 4 00 
GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 4 50 
CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGAGG 500 
TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA SSO 
TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 6 00 
CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGGGGTTTG 650 
ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 7 00 
TTTTGGCACG AA&ATCAAGG GGACTTTCCA AAATGTCGTA ACAACTCCGC 750 
CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 800 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC S50 
TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 9 00 
GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCA&GAGT GACGTAAGTA 950 
CCGCCTATAG ACT CT AT AGG CACACCCCTT TGGCTCTTAT GCATGCTATA 1000 
CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTTCCTTAT GCTATAGGTG 105 0 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT ATTGACCATT ATTGACCACT 1100 
CCCCTATTGG TGACGATACT TTCCATTACT AATCCATAAC ATGGCTCTTT 13.50 
GGCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAGAGAC 12 00 
TGACAGGGAC TCTGTATTTT TACAGGATGG GGTCCCATTT ATTATTTAC& IS 50 
AATTCACATA TACAACAACG CCGTGCCCCG TGCCCGCAGT TtTTATTAAA 13 00 
CATAGCGTGG GATCTCCACG CGAATCTCGG GTACGTGTTC CGGACATGGG 13 50 
CTCTTCTCCG GTAGCGGCGG AGCTTCCACA TCCGAGCCCT GGTCCCATGC 14 00 
CTCCAGCGGC TCATGGTCGC TCGGCAGCTC CTTGCTCCTA ACAGTGGAGG 14 50 
CCAGACTTAG GCACAGCACA ATGCCCACCA CCACCAGTGT G CCGCACAAG IS 00 
GCCGTGGCGG TAGGGTATGT GTCTGAAAAT GAGCGTGGAG ATTGGGCTCG 1550 
CACGGCTGAC GCAGATGGAA GACTTAAGGC AGCGGCAGAA GAAGATGCAG 16 00 
GCAGCTGAGT TGTTGTATTC TGATAAG&GT CAGAGGTAAC TCCCGTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTTGCTGC 17 00 
CGCGCGCGCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 1800 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAAACGACT 1850 
CAACAGCTTA ACGTTGGCTT GCCACGCATT ACTTGAGTGT AAAACTCTCA 1900 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAAACAT 1950 
AACATCAAAC GAATCGAGCG ATTGTTAGGT AATCGTCACC TCCACAAAGA 2000 
GCGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2 050 
GATGCCCATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAC 2100 
TTATGGT&TT GCGAGCTTCA GTCGCACTAC ACGGTCGTTC TGTTACTCTT 2150 
TATGAGA&AG CGTTCCCGCT TTCAGAGCAA TGTTCA&AGA AAGCTCATGA 2200 
CCAATTTCTA GGCGACCTTG CGAGCATTCT ACCGAGTAAC ACGACAGCGC 2250 
TCATTGTCAG TGATGCTGGC TTTAAAGTGC CATGGTATAA ATCCGTTGAG 2300 
AAGCTGGGTT GGTAGTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 2 3 50 
AGACCTAGGA GCGGAAAACT GGAAAGCTAT CAGCAACTTA CATGATATGT 24 00 
CATCTAGTCA CTCAAAGACT TTAGCCTATA AGAGGCTGAC T AAAAG C AAT 24 50 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCCGAAA 2500 
AAATCAGCGC TCGACACGGA CTCATTGTGA CCACCCGTCA CCTAAAATCT 2550 
ACTCAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAACTAA CTTACGTGTT 26 00 
GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 2650 
GATTGAAGAA ACCTTGCGAG ACTTGAAAAG TCCTGCCTAC GGACTAGGCC 2700 
TACGCCATAG CCGAACGAGC AGCTCAGAGG GTTTTGATAT CATCCTGCTA 2 750 
ATCGCCCTGA TGCTTCAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2800 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAA&TCGAA 285 0 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GCATTCTGGC 2900 
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AAATTTATTC 
GATCACTTCT 
TTTTTTGTGG 
CCCTCCCCCG 
TTCCTAATAA 
CTATTCTGGG 
GACAATAGCA 
CTCTCTCTCT 
CTCTCTCTCT 
TTGACCCGGT 

CrJf^lVXT' J^C» T' Or 
Cb> Ai ^.lT" C^^^?^k Cr> 

TATTTGAACA 
0 CT AT C C AJP^G 
TTAGCCTTGA 
ATCCAAGAGA 
GACCCTGAGC 
TTAftGGTGGA 
CACTCATTAG 
TGTGTGGAAT 
CATGATTACG 
GGAGCTCCAC 
AGGTGAGAAT 
AACAATAGCT 
TCCCTCGAAC 
ATCTGCCAGG 
AGTTCATATG 
GAGCTATGTT 
-J^tdr 

CGTCTGCTCT 
TT&TTTCTCC 
TTATGTCCTG 

ACTAAGTAAT 
TTTCTGTTTF 
GAACATGAGC 
GATTAGCAGA 
TATTTGCTCT 
CeCAATCCCA 
GGAACCTGTG 
CGGATCCATG 
TATTCAAGGA 
CCCATTGCCA 
CAGCACCAGG 
GATTCGGAGA 
TCTTCACTTA 
TTCGTTCAGC 
TGCCAGAATA 
CCTATCAACT 
CTGGGTAGAA 
GCTCCGTGGA 
AAAGGACTGT 
TTTCAGAGTG 
TTGGTTTATT 
GAGCTTCCAT 
TGAAGTCTCA 
TG&CTGAATG 
TACTTACCTC 
AATGGCTATG 
GCATGTCCTC 



CAAGGGAAGA GTTACTGGTG 
ACACATGGTT ACGCTTTGGG 
QGCTAATAAA AGATCAGAGC 
ATCTGCTGTG CCTTCFAJ3TT 
TGCCTTCCTT G ACCCTGG AA 



GGGTGGGGTG 
GGCATGCTGG 
CTCTCTCTCT 
CTCTCTCTCT 
GACCAAAGGT 
GTGCCTGTTA 



GCTGCAACCC 
G AAATTATG A 
TCTAGAGATC 



TTATCTTGAT 



GGATGCGGTG 
CTCTCTCTCT 
CTCTCTCTCT 
GCCTTTT AT C 



AT ACATT ACT GGTA&GGTAA 
ACCAACTTAA* AGCTTTCCTG 
ACTG.^TGAAT CCCCTAATGA 
TACACATCTT GTCATATGAT 



TGTG&GCGGA 
CCAAGCGCGC 
CGCGGTGGGG 
GGTTTCTTTA 
TCTATAACTG 
CATGAACACT 
CCATTAAGTT 
ATAAACACAT 
TTGCTGTATC 
AAAGATAGAT 
TCACTCTAGT 
TATTTTGTCA 
CCT AGC ATGG 
CAGACTTCTG 
AATTGCTAAT 
CTTAAAGATC 
AATATTTCCC 
ACAGGCAGAA 
CCATTGA&TC 
TT AAATG ATT 
GGTGGGTCAC 
GGCTCCATCG 
GCTCAAAGTC 
TCATGTCAGC 
ACAC&GATAA 
CAGT&TTG&A 
GAGACATCCT 
CTTGCC&GTA 
CTTGCAGTGT 



TAACAATTTC 
AAtTT A^A£^ C^ G T" 



rv.- 



CTGTTTGTCA 
AAATATATTT 
CCTCCAGCTG 
ATTCATGGAA 
TTGAAATTGA 



CTC&GTTGCC 

CTCAGATGCA 
GTCTG1TACT 
TATGTTTTCC 
CCATTATCTG 
AGTCTTCTCT 
AACACATTGT 
CAAAATGGAC 



GCGGAGGAAG 

$L ^tirffi^S* CS C** C^r 

TCTAGCCATG 
ATAAGGTTGT 
GCT C AGTGTG 
CAACCAAATC 
GACTTTATGC 
GTGAAGGAAC 



AGTC&GACAA 
TTCTCAAACT 
GGGAGAAAAG 
ACTGAGCAAG 
TAGAGTGGCA 
TTGCCAGTGG 
GGCCTTGAGC 



GCATGAAGAT 



AGCAGAGAGC 



ATGGAATTAT 
GCAATGGTTC 
ATTTAAGGAT 
AAAGCAAACC 
TGAATGGCTT 
GACAATGAGC 
AGCTTGAGAG 



GGTGCCACTC 
TTGTCTGAGT 
GCAAGGGGGA 
GGCTCTATGG 
CTCTCGGTAC 
CGGTACCAGG 

TTAATTATGA 
TTGGTCTGCC 
ATAATAATAA 
TGG AATGAAC 
ACGCCATTGT 
ACGGAATGTT 
TTTTGGTAAA 
CCCGGTAATG 
TATGCTTCCG 
ACACAGGAAA 
GACTAAAGGG 
AAGTAGTGGA 
ATT CT ATT AT 
GCTATTGTAT 
AATTTCACAA 
GATCTTTGAG 
GTATTGTTTT 
AAGTTTGTTA 
CAGCTATAGG 
TCCTTCACAT 
AGGTCACGTC 
CGTTGTAGAT 
ACAACCATAG 
ATCTCTAAGG 
GTTGTAACTG 
CCCATCCAAC 
G ^^CyAiA.T' 
CTATTGAAAC 
CA&AGGTC&A 
ATATATTCCC 
CATGGAATTT 
ATGAGAAGAT 
GTATACCTGG 
TCGCTTTGAT 
GCACATCTGT 
ACCAAAGGAA 
TGAAGAGAGA 
TGTATAGAGG 
GCCAGAGAGC 
CAGAAATGTC 
TGGTT AATG C 
GAAGACACAC 
TGTGCAGATG 
CTGAGAAAAT 
ATGTTGGTGC 
TATAATCAAC 
AAGAGAGGAA 



GGAGGAAAAA 



CTGAAGATAT CTCAAGCTGT 



TACTAGCTCA 2 9 SO 
TAATGATCGA 3 000 
TGTGTGTTGG 3 0SO 
TGTTGTTTGC 3100 
CCACTGTCCT 3150 
AGGTGTCATT 3200 
GGATTGGGAA 3250 
GTACCTCTCT 3 300 
CTCTCTCTCT 3350 
TGCTGAAGAA 34 0 0 
AAATAAAAAA 3450 
TTGATGCCTA 3 5 GO 
TTAGAAAGTA 3 55 0 
AAACCTTATC 3600 
TTGAAAAAAA 3650 
CAGCAAATTG 3700 
AATTCTCGTT 3750 
AATCATTAAG 3 800 
TGAGTTAGCT 3SS0 
GCTCGTATGT 3 900 
CAGCTATGAC 3 950 
AACAAAAGCT 4000 
TCCCCCGGGG 4 0 5 0 
TTCAATACAG 410 0 
ATTATGATTG 4150 
TTCCTCTGTC 4 200 
GAACACTGCA 4250 
GCATTGTATG 4 3 0 0 
TAAAGCATTC 4 350 
AAAGAAAGTG 4 4 0 0 
GCATGGTTCT 44 50 
TTGTTCTCAC 4S00 
ACAAGAAGGA 4 5 SO 
TAATAAGCAC 4S0 0 
TTCCeAGATT 4 6 5 0 
AAGCTCAATG 4 70 0 
AGTCCTGATG 4 7S0 
TAAAAACTAA 4 800 
TAAAATCTAA 4 850 
ACTTGTGAAG 4900 
CAGGGCTCAG 4 95 0 
TGTTTTGATG 5000 
CTTCTACTGC 505 0 
GTGCAAAAGA SX00 
AAACTTCCAG 5150 

AAACGTTCAC 5200 
ATGATGTTTA 5250 
TACCCAATCC 5300 
AGGCTTGGAA 53 50 
TCATCAATTC 54 00 
CTTCAGCCAA 54 5 0 
CATTGTCTTC 5500 
AAGCAATGCC 5550 
ATGTACC AG A 5 6 0 0 
GAAGATCCTG 5650 
TGTTGCCTGA 57 0 0 
TTTGAAAAAC 5750 
GATCAAAGTG 5800 
CATCTGTCTT 
AATCTGTCTG 590 0 
CCATGCAGCA SSS0 



63 



BP 1 539 785 B1 



CATGCAGAAA TGAATGAAGC AGGCAGAGAG GTGGTAGGGT CAGCAGAGGC 6000 
TGGAGTGGAT GCTGCAAGCG TCTCTGAAGA ATTTAGGGCT GACCATGCAT 6050 
TCCTCTTCTG TATCAAGCAC ATCGCAACCA ACGCCGTTCT CTTCTTTGGC 6100 
AGATGTGTTT CCCCTCCGCG GCCAGCAGAT GACGCACCAG CAGATGACGC 6150 
ACCAGCAGAT GACGCACCAG CAGATGACGC ACCAGCAGAT GACGCACCAG 6200 
CAGATGACGC AACAACATGT ATCCTGAAAG GCTCTTGTGG CTGG&TCGGC 6250 
CTGCTGGATG ACGATGACAA AAAATACAAA AAAGCACTGA AAAAAGTGGC 63 00 
AAAACTGCTG TAATGAGGGC GCCTGGATCC AGATCACTTC TGGCTAATAA 6350 
AAGATCAGAG CTCTAGAGAT CTGTGTGTTG GTTTTTTGTG GATCTGCTGT 64 00 
GCCTTCTAGT TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT 64 50 
TGACCCTGGA AGGTGGCACT CCCACTGTCC TTTCCTAATA AAATGAGGAA 6500 
ATTGCATCGC ATTGTCTGAG TAGGTGTCAT TCTATTCTGG GGGGTGGGGT 6550 
GGGGCAGCAC AGGAAGGGGG AGGATTGGGA AGAGAATAGC AGGCATGCTG 6600 
GGGATGCGGT GGGCTCTATG GGTACCTCTC TCTCTCTCTC TCTCTCTCTC 6650 
TCTCTCTCTC TCTCTCGGTA CCTCTCTCG& GGGGGGGGCC GGTACCCAAT €700 
TCGCCCTATA GTGAGTCGTA TTACGCGCGC TCACTGGCGG TCGTTTTACA 6750 
ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT CGCCTTGCAG 6800 
CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGGACCGAT 6850 
CGCCCTTCCC AACAGTTGCG GAGCCTGAAT GGCGAATGGA AATTGTAAGC 6900 
GTTAATATTT TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT 6950 
TTTTAACCAA TAQGGCGAAA TCGGCAAAAT GCCTTATAAA TCAAAAGAAT 7000 
AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA 7050 
TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG 7100 
CGATGGCCGA CTACTCCGGG ATCATATGAC AAGATGTGTA TCCACCTTAA 7150 
CTTAATGATT TTTACCAAAA TCATTAGGGG ATTCATCAGT GCTCAGGGTC 7200 
AACGAGAATT AACATTCCGT CAGGAAAGCT TATGATGATG ATGTGCTTAA 7250 
AAACTTACTC AATGGCTGGT TATGCATATC GCAATACATG CGAAAAACCT 7300 
AAAAGAGCTT GCCGATAAAA AAGGCCAATT TATTGCTATT TACCGCGGCT 73 50 
TTTTATTGAG CTTGAAAGAT AAATAAAATA GATAGGTTTT ATTTGAAGCT 74 00 
AAATCTTCTT TATCGTAAAft AATGCCCTCT TGGGTTATCA AGAGGGTCAT 7450 
TATATTTCGC GGAATAACAT CATTTGGTGA CGAAATAACT AAGCACTTGT 7500 
CTCCTGTTTA CTCCCCTGAG CTTGAGGGGT TAACATGAAG GTCATCGATA 7550 
GCAGGATAAT AATACAGXAA AACGCTAAAC CAATAATCCA AATCCAGCCA 7600 
TCCCAAATTG GTAGTGAATG ATTATAAATA ACAGCAAACA GTAATGGGCC 7650 
AATAACACCG GTTGCATTGG TAAGGCTCAC CAATAATCCC TGTAAAGCAC 7 700 
CTTGCTGATG ACTCTTTGTT TGGATAGACA TCACTCCCTG TAATGCAGGT 7750 
AAAGCGATCC CACCACCAGC CAATAAAATT AAAACAGGGA AAACTAACCA 7800 
ACCTTCAGAT ATAAACGCTA AAAAGGGAAA TGCACTACTA TCTGCAATAA 7850 
ATCCGAGCAG TACTGCCGTT TTTTCGCCCC ATTTAGTGGC TATTCTTCCT 7 900 
GCCACAAAGG CTTGGAATAC TGAGTGTAA& AGACCAAGAG CCGGTAATGA 7950 
AAAGCCAACC ATCATGCTAT TCCATCGAAA ACGATTTTCG GTAAATAGCA S000 
CCGACACCGT TGCGGGAATT TGGCCTATGA ATTGCGCTGA AAAATAAATA 8050 
ATCAACAAAA TGGCATGGTT TTAAATAAAG TGATGTATAC CGAATTCAGC 8100 
TTTTGTTCCC TTTAGTGAGG GTTAATTGCG CGCTTGGCGT AATCATGGTC 8150 
ATAGCTGTTT CCTGTGTG&A ATTGTTATCC GCTCACAATT CCACACAACA 8200 
TACGAGCCGG AAGCATAAAG TGTAAAGCCT GGGGTGCCTA ATGAGTGAGC 8250 
TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA S3 00 
CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG 3 3 50 
GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC 8400 
TCGGTCGTTG GGCTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT 84 50 
ACGGTTATCC AC AG AATCAG GGGATAACGC AGGAAAGAAC ATGTGAGCA& 8500 
AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT 8550 
TTCCATAGGC TCCGCCCCCC TGAGGAGCAT CACAAAAATC GACGCTCAAG 8600 
TCAGAGGTGG CGAAACCGGA CAGGACTATA AAGATACCAG GCGTTTCCCC 8650 
CTGGAAGCTC GCTCGTGGGC TCTCCTGTTC GGAGGGTGCC GCTTACCGGA 8700 
TACCTGTCCG CCTTTCTCCC TTGGGGAAGC GTGGCGCTTT CTGATAGCTC 8 750 
ACGCTGTAGG TATCTCAGTT CQGTGTAGGT CGTTCGCTCC AAGCTGGGCT 8 8 00 
GTGTGCACGA ACCCCGCGTT GAGCCCGACC GCTGCGGCTT ATCGGGTAAG 8850 
T&TCGTCTTG AGTCCAACCC GGT AAGACAC GACTTATCGC CACTGGCAGC 8500 
AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTAGAG 8 950 
AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT 9000 
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GGTATCTGCG 
CTCTTGATGC 
GCAAGCAGCA 
ATCFFTTCTA 
GATTTTGGTC 
A^TAAAAATG 
TGTGACAGTT 
TCTATTTCGT 
CGATACGGGA 
GACCCACGCT 
AAGGGCCGAG 
CTATTAATTG 
TTGCGCAACG 
GTTTGGTATG 
GATGATGCCC 
ATCGTTGTCA 
AGCACTGGAT 
TGAGTGGTGA 
CCGAGTTGGT 
CAGAAGTTTA 
TCTCAAGGAT 
GCAGCCAACT 
AGCAAAAACA 
GGAAATGTTG 
TATGAGGGTT 
AAATAAACAA 



CTCTGCTGAA 
GGG AAAOAAA 
GATTACGCGC 
GGGGGTCTGA 



GCCAGTTACC 
CCACCGCTGG 
AGAAAAAAAG 



TTCGGAAA&& 
TAGCGGTGGT 



i:': 



AAGTTTTAAA 
ACGAATGCTT 
TGATCCATAG 
GGGCTTAGGA 
GACCGGCTCC 

cgcagaagtg 

TTGCCGGGAA 
TTGTTGGCAT 
GCTTCATTGA 
CATGTTGTGG 
GAAGTAAGTT 
AATTCTCTTA 
GTACTCAACC 

AAAGTGCTCA 
CTTACCGCTG 
GATCTTCAGC 
GGAAGGCAAA 
AATACTCATA 
ATTGTGTCAT 
ATAGGGGTTC 



AATCAGTGAG 
TTGCCTGAGT 
TCTGGCCCCA 
AGATTTATCA 
GTCCTGCAAC 



GCTCCGGTTG 



GGCGGCAGTG 
CTGTCATGCC 
A&GTCATTCT 
GTCAATAGGG 
TCA'TTGGAAA 
TTGAGATCCA 
ATGTTTTACT 
ATGCCGCAAA 
CTCTTCCTTT 
G&GCGGATAC 
CGCGCACATT 



AACGAAAACT 
CTTCACGTAG 
GTATATATGA 
GCAGCTATCT 
CGCCGTCGTG 
GTGCTGCAAT 
GCAATAAACC 
TTTATCCGCC 
GTAGTTCGCC 
ATCCTGGTGT 

TTAGCTCCTT 
TTATCACTCA 
ATCCGTAAGA 
G&GAATAGTG 
GATAATACCG 
ACGTTCTfCG 
GTTCGATGTA 
TTCACCAGCG 
AAAGGGA&TA 
TTGAATATTA 
ATATTTG&AT 
TCCCCGAAAA 



GAGTTGGTAG 
TTTTTTGTTT 
AGATCCTTTG 
CACGTTAAGG 
ATCCTTTTAA 
GTAAACTTGG 
GAGCGATCTG 
TAGATAACTA 
GATACCGCGA 
AGCCAGCGGG 
TGGATCCAGT 
AGTTAATAGT 
CAGGCTCGTC 
AGGCGAGTTA 
CGGXCCTGCG 
TGGTTATGGG 
TGCTTTTCTG 
TATGCGGCGA 
CGCCAGATAG 
GGG CGAAAAC 
AGCCACTCGT 
TTTCTGGGTG 
AGGGCGACAG 
1TGAAGCATT 
GTATTTAGAA 
GTGCCAC 



5050 
5100 
9150 
9200 
9250 
9300 

$4 00 
94 SO 

9500 

9550 

9600 

2£50 

97 00 

9750 

9800 

9850 

9900 

9930 

100 00 

1005G 

10100 

1015 0 

xoaoo 

10250 
10297 
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CTGACGCGCC 
GGCAGCGTGA 
TTTCTTCCCT 
GGCA1 'jCGGAT 
CATGTCCAAC 
TAGT&ATG&A 
CGTTACATAA 
CCCGGCGATT 
GGGAGTTTCC 
CTTGGCAGTA 
TCAATGACGG 
TGGGACTTTC 
CATGGTGATG 
ACTCACGGGG 
TTTTGGCACC 
CGCATTGAGG 
G C AGAGCT GG 
TGTTTTGACC 
GGAAGGGT'GG 
CCGCCTATAG 
CTGTTTTTGG 
ATGGTATAGC 
CGGCTATTGG 
GCCACAACTA 
TGAGAGGGAG 

GATAGGGTGG 
CTCTTCTCCG 
CTC C AGCGG G 
CCAGACTTAG 
G C CGTGG GGG 



CTGTAGCGGG GCATT&AGCG 
CCGCTAGAGT TGCCAGCGCC 
TCCTTTGTCG GCAGGTTCGC 
ACGTTGTATG CATATCATAA 
ATTACCGCCA TGTTGACATT 
TTACGGGGTG ATTAGTTCAT 



CTTAGGGTAA ATGGCCCGCC 



GACGTC A AT A 
ATTGACGTCA 
CATCAAGTGT 




ATCATATGCC 



TAAATGGGGG GCCTGGCATT 
CTACTTGGCA GTACATCTAC 
CGGTTTTGGC AGTAGATCAA 
ATTTCCAAGT CTCCAGCCCA 
AAAATCAACG GGACTTTCCA 
CAAATGGGCG GTAGGCGTGT 
TTT AGTG AAG CGTGAGATCG 
TCCATAGAAG 
ATTGGAACGC 
ACTCTATAGG 
GTTGGGG OCT 
TTAGCCTATA 



AC AC CGGGAC 



CACAGCCCTT 
ATACACCCCC 
GGTGTGGGTT 



TGACGATACT TTCCATTACT 
TCTCTATTGG CTATATGCCA 
TCTGTATt^T TAGAGGATGG 
TACAAGAAGG CCGTCCCCCG 



GCAGAGCAGA ATGC'CCACCA 
TAGGGTATGT GTCTGAAAAT 



CGGCGGGTGT 

A K, 1» 

CGGCATCAGA 
TATGTACATT 
GATTATTGAC 
AGOC'w A .i. AX A 
TGGCTGACGG 
TTCCCATAGT 
TATTTACGGT 
AAGTACGCCC 
ATGCCCAGTA 
GTATTAGTCA 
TGGGCGTGGA 
TTGAGGTGAA 
AAATGTCGTA 
ACGGTGGGAG 
CCTGGAGAGG 
GGATGCAGGC 
TGCCAAGAGT 
TGGCTCTTAT 
GCTTCCTTAT 
ATTGACCATT 
AATCCATAAC 
ATACTCTGTC 
GGTCCCATTT 
TGCCCGCAGT 
GTACGTGTTC 
TCCGAGCCCT 
CTTGCTCCTA 
CCACCAGTGT 
GAGCGTGGAG 



GGTGGTTACG SO 
CTGGTTTCGC 100 
TTGGCTATTG ISO 
TATATTGGCT 200 
TAGTTATTAA 2S 0 
TGGAGTTCCG 3 0 0 
CCCAACGACC 3 S 0 
AACGCCAATA 4 00 
AAACTGCCCA 45 0 
CCTATTGACG 5 00 
CATGAGCTTA SS 0 
TCGCTATTAC 600 
TAGGGGTTTG £50 
TGGGAGTTTG 700 
ACAACTCCGC 750 
GT GT AT AT AA 800 
CCATCCACGC 850 
TCCGCGGCCG 900 
GACGTAAGTA 9S0 
GCATGGTATA 1000 
GCTATAGGTG 10 SO 
ATTGAGGACT 1100 
ATGGCTCTTT 1150 
CTTCAGAGAG 1200 
ATTA^TTAGA 1250 
TTTTATTAAA 1300 
CGGACATGGG 1 35 0 
GGTCCCATGC 1400 
ACAGTGGAGG 1450 
GCCGCACAAG 1500 
ATTGGGCTCG 155 0 
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CACGGCFGAC GCAGATGGAA GAGTTAAGGC AGCGGCAGAA GAAGATGCAG 1600 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT CAGAGGTAAC TCCCGTTGCG 1650 
GTGCTGTTAA CGGTGGAGGG CAGTGTAGTC TGAGCAGTAC TCGTXGCTGC 1700 
CGCGCGCQCC ACCAGACATA ATAGCTGACA GACTAACAGA CTGTTCCTTT 1750 
CCATGGGTCT TTTCTGCAGT CACCGTCGGA CCATGTGTGA ACTTGATATT 18 00 
TTACATGATT CTCTTTACCA ATTCTGCCCC GAATTACACT TAAA&CGACT X85Q 
CAACAGGTTA ACGTTGGCTT GCCACGCATT ACTTGACTGT AAAACTCTCA 19 Q0 
CTCTTACCGA ACTTGGCCGT AACCTGCCAA CCAAAGCGAG AACAAAACAT X9S0 
AACATCAAAC GAATCGACCG ATTGTTAGGT AATCGTCACG TCCACAAAGA 2 000 
GGGACTCGCT GTATACCGTT GGCATGCTAG CTTTATCTGT TCGGGAATAC 2050 
G ATG C.CC ATT GTACTTGTTG ACTGGTCTGA TATTCGTGAG CAAAAACGAG 2100 
TTATGGTATT GCGAGCTTCA GTCGCACTAC ACGGTGGTTC TGTTACTCTT 2 150 
TATGAGAA&G CGTTCCCGCT TTCAGAGCAA TGTTCAAAGA AAGCTCATGA 2200 
CCAATTTCTA GCCGACCTTG CGAGCATTCT ACCGAGTAAC ACCACACCGC 2250 
TCATTGTGAG TGATGGTGGC TTTAAAGTGC CATGGTATAA ATCGGTTGAG 2 3 00 
AAGCTGGGTT GGTACTGGTT AAGTCGAGTA AGAGGAAAAG TACAATATGC 23 50 
AGACCTAGGA GCGGAAAACT GGAAACCTAT CAGCAACTTA CATGATATGT 24 00 
CATCTAGTCA GTCAAAGACT TTAGGCTATA AGAGGCTGAC TAAAAGCAAT 24 50 
CCAATCTCAT GCCAAATTCT ATTGTATAAA TCTCGCTCTA AAGGCGGAAA 25 00 
AAATCAGCGG TCGACACGGA CTCATTGTCA CCACCCGTCA CCTAAAATCT 255 0 
ACTGAGCGTC GGCAAAGGAG CCATGGGTTC TAGCAACTA& CTTACCTGTT 2600 
GAAATTCGAA CACCCAAACA ACTTGTTAAT ATCTATTCGA AGCGAATGCA 26 50 
GATTGAAGAA ACGTTCCGAG AGTTGAAAAG TCCTGCCTAC GGACTAGGCC 2 700 
TACGCCATAG CCGAACGAGC AGCTCAGAGC GTTTTGATAT CATGCTGGTA 2750 
ATCGCCCTGA TGGTXGAACT AACATGTTGG CTTGCGGGCG TTCATGCTCA 2 800 
GAAACAAGGT TGGGACAAGC ACTTCCAGGC TAACACAGTC AGAAATCGAA 2 850 
ACGTACTCTC AACAGTTCGC TTAGGCATGG AAGTTTTGCG GC&TTCTGGC 2 90 0 
TACACAATAA C&AGGGAAGA CTTAGTGGTG GCTGC AACCC TACTAGCTCA 2 950 
AAATTTATTC ACACATGGTT ACGCTTTGGG GAAATTATGA TAATGATCCA 3 000 
GATCACTTCT GGCTAATAAA AGATGAGAGC TGTAGAGATC TGTGTGTTGG 3 050 
TTTTTTGTGG ATCTGCTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 3100 
CCCTCCGCCG TGCCTTCCTT GACCGTGGAA GGTGCGACTC CCACTGTCCT 315 0 
TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3 200 
CTATTCTGGG GGGTGGGGTG GGGCAGCACA GGAAGGGGGA GGATTGGGAA 3 2 50 
GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTACGTCTCT 3 3 00 
CTCTCTCTGT CTCTCTCTCT CTCTCTCTGT CTGTCGGTAC CTCTCTCTGT 3 3 50 
CTCTGTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCGGT GAGCAAAGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 34 50 
CAATTACTCA GTGCCTGTTA TAAGCAGCAA TTAATTATGA TTGATGCCTA 3 500 
CATCACAACA &AAAGTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 3 550 
TATTTGAACA TTATCTTGAT TATATTATTG ATAATAATAA AAACCTTATC 3 600 
CCTATCCAAG AAGTGATGCC T&TCATTGGT TGGAATGAAC TTGAAAAAAA 3 650 
TTAGCCTTGA ATACATTACT GGTAAGGTAA ACGCCATTGT CAGCAAATTG 3 7 00 
AT CCAAG AG A ACGAACTTAA AGCTTTCCTQ ACGGAATGTT AATTCTCGTT 3750 
GACCCTGAGC ACTGATGAAT CCCCTAATGA TTTTGGTAAA AATCATTAAG 3 8 00 
TTAAGGTGGA TACACATCTT GTGATATGAT CCCGGTAATG TGAGTTAGCT 3 8 50 
CACTCATTAG GCACCCGAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3 900 
TGTGTGG AAT TGTGAGCGGA TAACAATTTC ACACAGGAAA GAGCTATGAC 3 $50 
CATGATTACG CCAAGGGCGC AATTAACCCT CACTAAAGGG AACAAAAGCT 4 000 
GGAGCTCCAG CGCGGTGGOG GCCGCTCTAG AACTAGTGGA TCCCGCGGGG 4 050 
AGGTCAGAAT GGTTTCTTTA CTGTTTGTGA ATTCTATTAT TTCAATACAG 4100 
AACAAAAGCT TCTATAACTG AAATATATTT GCTATTGTAT ATTATGATTG 4150 
TCCCTCGAAC CATGAACACT CCTCCAGCTG AATTTCACAA TTCCTCTGTC 4 200 
ATCTGCCAGG CTGGAAGATC ATGG&AGATC TCTGAGGAAC ATTGCAAGTT 4250 
C&TACCATAA ACTCATTTGG A&TTGAGTAT TATTTTGCTT TGAATGGAGC 4 300 
TATGTTTTGC AGTTGCCTCA GAAGAAAAGC TTGTTATAAA GCGTCTACAG 4 3 SO 
CCATCAAAAG ATATATTTAA ATATTCGAAC TACAGA&AGA TTTTGTCTGC 44 00 
TCTTCACTCT GATCTCAGTT GGTTTCTTCA CGTACATGCT TCTTTATTTG 44 50 
CCTATTTTGT CAAGAAAATA ATAGGTCAAG TCCTGTTCTC ACTTATCTCC 4 500 
TGCCTAGCAT GGCTTAGATG CACGTTGTAC ATTCAAGAAG GATCAAATGA 4 550 
AACAGACTTC TGGTCTGTTA C&ACAACCAT AGTAATAAAC AGACTAACTA 4 600 



68 
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15 



ATAATTGCTA ATTATGTTTT CCATCTCTAA GGTTCCCACA TTTTTCTGTT 4650 
TTA&GATCCC ATTATCTCGT TGTAACTGAA GCTCAATGGA ACATGAACAG 4 7 00 
TATTTCTCAG TCTTTTCTCC AGCAATCCTG ACGGATTAGA AGAACTGGCA 4750 
GAAAAGACTT TGTTACCCAG 



ATCCAAAATG GACCTATrGA 
ATTTCT&TGG CGTCAAAGGT 
CCCAATTCAG GCTATATATT 



AATTAAAAAC TAAT&TTTGC TCTCCCTTCA 4 800 
&ACT&AAATC TGACCCA&TG eCATTAAATT 4 850 
CAAACTTTTG AAGGGAACCT GTGGGTGGGT 4 900 
CGCCAGGGCT CAGGGAGTGG ATCCATGGGG 4 9 50 
TGCATGGGTG CAGCAAGCAT GGAATT"tTGT TTTGATGTAT TCAAGGAGCT 5000 
CAAAGTCCAC CATGCCAATG ACAACATGCT CTACTGGCCC TTTGCCATGT 5050 
4(0 TGTCAACTCT GGCC&TGGTC TTCCTAGGTG CA&AAGACAG GAGCAGGACC 5100 

CAGATAAATA AGGTTGTTCA CFTTGATAAA CTTCGAGGAT TCGGAGACAG 5150 
TATTGAAGCT CAGTGTGGCA CATCTGTAAA TGTTCACTCT TCACTTAGAG 5300 
ACATAGTCAA CCAAATGACC AAACAAAATG ATGCTTATTC GTTCAGCCTT 5250 
GGCAGTAGAC TTTATGCTCA AGAGACATAC ACAGTCGTGC CGGAATACTT 53 00 
GCAATGTGTG AAGG&AGTGT ATAGAGGAGG CTTAGAATCG GTCAACTTTC 5350 
AAACAGCTGC AGATCAAGCC &GAGGGCTCA TCAATGCCTG GGTAGAAAGT 54 00 
CAGACAAACG GAATTATCAG AAACATCCTT CAGCCAAGGT CCGTGGATTC 54 50 
TCAAACTGCA ATGGTCCTGG TTAATGCCAT TGGCTTCAAG GGACTGTGGG 5 $ 0 0 
AGAAAGCATT TAAGGGTG&& GACACGCAAA CAATACCTTT GAG AGXGAGT 5550 
GAGCAAGAAA GCAAACCTGT GCAGATGATG TACCAGATTG GTTCATTTAA 5600 
20 AGTGGCATC& ATGGCTTCTG AGAAAATGAA GATCGTGGAG GTTCCATTTG 5650 

CCAGTGGAAC AATGAGCATG TTGGTGCTGT TGCCTGATGA TGTCtCAGGC 5 700 
CTTGAGCAGC TTGAGAGTAT AATGAGCTTT GAAAAACTGA CTG &ATGGAG 5 7 5 0 
CAGTTCTAGT ATTATGGAAG AGAGGAAGGT CAAAGTGTAC TTACCTGGCA 5 S "6 0 
TGAAGATGCA GGAGAAATAC AACCTGAGAT CTGTCTTAAT GGCTATGGGA 5850 
ATTACTGACC TGTTCAGCTC TTCAGCCAAT CTGTCTGGGA TGTGGTCAGT 5 90 0 
AGGGAGCCTG AAGATATCTC AAGCTGTCCA TGCAGCACAT GCAGAAATCA 5350 
ATGAAGCGGG CAGAGATGTG GTAGGCTCAG CAGAGGCTGG AGTGGATGCT 6 000 
ACTGAAGAAT TTAGGGCTGA CCATCCA.TTC CTGTTCTGTG TCAAGCACAT 6050 
CGAAACCAAC GC C ATTCTCC TCTTTGGC&G ATGTGTTTCT CCGCGGGCAG 6100 
CAGATGACGC ACCAGCAGAT GACGCACCAG CAGATGACGC ACCAGCAGAT 6150 
30 GACGCACCAG CAGATGACGC ACCAGC&G&T GACGCAACAA CATGT&TCCT 6200 

GAAAGGCTCT TGTGGCTGGA XGGGCCTGCT GGATGACGAT G ACA A AAAAT 6250 
ACAAAAAAGC ACTGAAAAAA CTGGCAAA&C TGCTGTAATG AGGGCGCCTG 6300 
GATCCAGATC ACTTCTGGCT AATAAAAGAT C AG AG CTCTA GAGATCTGTG 63 5 0 
TGTTGGTTTT TTGTGGATCT GCTGTGCCTT CTAGTTGCCA G CC&TCTGTT 64 00 
GTTTGGCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC 64 50 
35 TGTCCTTTCC TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGT&GGT 6500 

GT C ATTCT AT TCTGGGGGGT GGGGTGGGGC AGCACAGGAA GGGGGAGGAT 5550 
TGGGAAGACA ATAGCAGGCA TGCTGGGGAT GCGGTGGGCT CTATGGGTAC 6600 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCTCT 6650 
CTCGAGGGGG GGCCCGGTAC CCAATTCGCC CTATAGTGAG TCGTATTACG 6 700 
40 CGCGCTCACT OQCCGTCGTT TT&CAACGTC GTGACTGGGA AAACCCTGGC 6750 

GTTACCCAAC TTAATCGCCT TGCAGCACAT CCCCCTtTCG CCAGCTGGCG 6 800 
TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG TTGCGCAGCC 6 850 
TGAATGGGGA ATGGAAATTG TAAGCGTTAA TATTTTGTTA AAATTCGCGT 6900 
TAAATTTTTG TTAAATGAGC TCATTTTTTA ACCAATAGGC CGAAATCGGC 6 9 5 0 
AAAATCCCTT ATAAATCAAA AGAATAGACC GAGATAGGGT TGAGTGTTGT 7 000 
45 TCCAGTTTGG AACAAGAGTC CACTATTAAA GAACGTGGAC TCCAACGTCA 7050 

AAGGGCGAAA AACGGTCTAT CAGGGCGATG GCCCACTACT CCGGGATCAT 7100 



ATGACAAGAT GTGTATGGAC 
AGGGGATTCA TCAGTGCTCA 



CTTAACTTAA TGATTTTTAG CAAAATCAXT 7X50 
GGGTGAACGA GAATTAACAT TCCGTCAGGA 72 CO 
AAGCCTATGA TGATGATGTG GTTAAAA&CT TACTCA^TCG CTGGTTATGC 7 250 
ATATCGCAAT ACATGCGAAA AAGCTAAAAG AGCTTGCCGA TAAAAAAGGC 73 00 
GAATTTATTG CXATTTACGG GGGGTTTTTA TTGAGCTTGA AAGATAAATA 7350 
AAATAGATAG GTTTTATTTG AAGCTAAATC TTCTTTATCG TAAAAAATGC 7400 
GCTCTTGGGT TATGAAGAGG GTCATTATAT TTCGCGGAAT AAGATCATTT 7 4 50 

CTTGTCTCCT GTTTACTCCC CTGAGCTTGA 75 00 
CGATAGCAGG ATAATAATAG AGTAAAAGGC 7550 
AGCCATCCCA AATTGGTAGT GAATGATTAT 76 00 



GGTGACGAAA TAACTAAGGA 
GGGGTTAACA TGAAGGTGAT 
^ TAAACCAATA ATCGAAATCC 



AAATAACAGC AAACAGTAAT GGGCCAATAA CACCGGTTGG ATTGGTAAGG 76 50 



6T 
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CTCACCAATA ATCCCTGTAA AGCACCTTGC TGATGACTCT TTGTTTGGAT 7700 
AGACATCACT CCCTGTA&TG CAGGTAAAGC GATCCCACCA CCAGCCAATA 7750 
AAATTAAAAC AGGGAAAACT A&CCAACCTT CAGATATAAA CG CTAAAA&G 7 800 
GCAAATGCAC TACTATCTGC AATAAATCCG AGCAGTAGTG CCGTTTTTTC 7 850 
GCCCCATTXA GTGGCTATTC TTCCTGCC&C AAAGGCTTGG AATACTGAGT 7 $00 
GTAA&AG&CG AAGAGCCGCT A&TG&AAAGC CAACCATCAT GCTATTCCAT 7950 
CCAAAACGAT TTTCGGTAAA TAGCACCCAC ACCGTTGCGG GAATTTGGCC 8000 
TATCAATTGC GCTGAAA&AT AAATAATCAA CAAAATGGCA TCGTTTTAAA 8050 
TAAAGTGATG TATACCGAAT TCAGCTTTTG TTCCCTTTAG TGAGGGTTAA SI 00 
TTGCGCGCTT GGCGTAATCA TGGT CAT AGC TGTTTCGTGT GTGAAATTGT 8150 
TATCCGCTCA CAATTCCACA G&ACATAGGA GCCGGAAGCA TAAAGTGTAA 8200 
AG CCTGGGGT GGCTAATGAG TGAGCTAACT CACATTAATT GCGTTGCGCT 8250 
CACTGCCCGC TTTCCAGTCG GGAAAGCTGT CGTGCCAGCT GCATTAATGA 83 00 
ATCGGCCAAC GCGCGGGG&G AGGCGGTTTG CGTATTGGGC GCTCTTCCGC 83 50 
TTOCTCGCTC ACTGACTCGC TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG 04 00 
TATCAGCTCA CTCAAAGGCG GT&ATACGGT TATCCACAGA ATGAGGGGAT 84 5 0 
AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 030 0 
TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 8550 
AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CGCGAGAGGA 86 00 
CTATAAAGAT ACCAGGCGTT TCCGCCTGGA AGCTGCCTCG TGCGCTCTCC 8650 
TGTTCCGACC CTGCCGCTTA CCGGATACCT GTCCG CCTTT CTCCCTTCGG 8700 
GAAGCGTGGC GGTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG 8750 
TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAAGCCC CCGTTCAGCC 88 00 
CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 8850 
GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 8900 
GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 8950 
CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 9000 
TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA ACAAACCACC §050 
GCTGGTAGCG GTGGTTJTTT TGTTTGCAAG CAGCAGATTA CGCGC AG AAA 9100 
AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTTGACGCTC 915 0 
AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 9200 
AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 9250 
CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTT&ATCA 9300 
GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 9350 
TGACTCCCCG TCGTGTAGAT AfcCTACGATA CGGGAGGGCT TACCATCTGG 94 00 
CCCCAGTGCT GGAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 94 50 
TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT 9500 
GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG 9550 
AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTA 9600 
CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGGTGC 9650 
GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCGGATGT TGTGCAAAAA 9700 
AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGGGG 9750 
CAGTGTTATC ACT GATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 9 800 
ATGCCATGCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 9850 
ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGGTGTTGG CCGGCGTGAA 9900 
TACGGGATAA TACCGCGCC& CATAGCAGAA CTTTAAAAGT GCTCATCATT 995 0 
GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 10000 
ATCGAGTTCG ATGTAAGCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 10050 
TTACTTTCAG CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 10100 
GCAAAA&AGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 10150 
CCTTTTTCAA TATT&TTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 10200 
GATACATaTT TGAATGTATt TAGAAAAATA AACAAATAGG GGTTCCGCGC 102 50 
ACATTTCCCC G&AAftGTGCC AC 10272 



SEQ ID NG:31 (pTnMod {Oval /EOT tag/Proins/PA) ~ Chicken) 

CTGAGGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTAGG 50 

CGCAGCGTGA CGGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC 100 

TTTCTTCCGT TCCTTTCTCG CCACGTTCGC CGGCATC&GA TTGGCTATTG 150 

GCGATTGCAT ACGTTGTATC CATATCATAA TATGTACATT TAT&TTGGCT 200 
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5 



W 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CATGTGCAAC 
TAGTAATGAA 

CGCGCCCATT 
GGGAGTTTCC 
CTTGGGAGTA 
TCAATGAGGG 
TGGGAGTTTC 
GATGGTGATG 
AGTCACGGGG 
TTTTGGCACG 
CGCATTGACG 
GCAGAGCTCG 
TGTTTTGACC 
GGAACGGTGC 
CGGGGTATAG 
CTGTTTTTGG 
ATGGTATAGC 
CCCCTATTGG 
GCGAGAACTA 
TGAGACGGAG 

GATAGGGTGG 
CTCTTCTCCG 
GTGGAGCGGG 
GGAGACTTAG 

GACGGCTGAC 
GCAGCTGAGT 
GTGGTGTTAA 

a pace* ca a g 

GCAXGGGTCT 
TTACATGATT 
GAACAGGTTA 

AAGATCAAAC 
GCGACTCGCT 
GATGCGGATT 
TTATGGTATT 
TATGAGAAAG 
CCAATTTCTA 
TCATTGTCAG 
AAGCTGGGTT 
AGAGCTAGGA 
CATCTAGTGA 
GGAATGTCAT 
AAATCAGCGC 
AOT'CAGGG'X'G 
GAAATTCGAA 
GATTGAAGAA 
TAGGGCATAG 
ATCGCCCTGA 
GAAAGAAGGT 
AGGTACTCTG 

AAATTTATTC 
GATCAGTTCT 
TTTTTTGTGG 
CCCTCCCCCG 
TTCCTAATAA 
CTATTCTGGG 



ATTACCGGGA 
TTACGGGGTC 
CTTACGGTAA 
GAGGTGAATA 
ATTGAGGTCA 
CATGAAGTG7 
TAAATGGGGC 
CTAGTTGGCA 
CGGTTTTGGC 
ATTTCCAAGT 
AAAATGAACG 
CAAATGGGGG 
TTTAGTGAAG 
TCCATAGAAG 
ATTGGAACGC 
AGTCTATAGG 
CTTGGGGCCT 
TTAGCGTATA 
TGACGATACT 
TCTCTATTGG 
TCTGTATTTT 
TACAACAAGG 
GATCTCCAGG 
GTAGCGGCGG 
TCATGGTCGC 
GCACAGCACA 
TAGGGTATGT 
GCAG&TGGA& 
TGTTGTATTC 
GGGTGGAGGG 
ACCAGACATA 
TTTCTGCAGT 
CTCTTTACCA 
ACG T TGGCTT 
ACTTGGCCGT 
GAATCGAGCG 
GTATACGGTT 
GTACTTGTTG 
GCGAGCTTCA 
CGTTCCCGCT 
GCCGAGGTTG 
TGATGCTGGC 
GGTACTGGTT 
GCGGAAAACT 
CTCAAAGACT 
GCCAA&TTGT 
TCGACAGGGA 
GGGAAAGGAG 
CACCCAAAGA 



TGTTGACATT 
ATTAGTTCAT 



ATGGGTGGAG 



GCGTGGCATT 
GTACATCTAG 
AGTACATCAA 
CTCCACCCCA 
GGAGTTTGGA 
GTAGGCGTGT 



GACACCCCTT 
ATACACCCCC 
GGTGTGGGTT 
TTCCATTACT 
CTATATGGGA 
TACAGGATGG 



GGAATCTCGG 



TCGGCAGCTG 
ATGCCCACCA 
GTGTGAAAAT 
GAGTTAAGGG 



GAGTGTAGTC 
ATAGCTGACA 
GACGGTCGGA 



GCChCG CATT 

ATTGTTAGGT 
GGCATGCTAG 
ACTGGTGTGA 



TTTAAAGTGC 
AAGTCGAGTA 
GGAAACCTAT 
TTAGGCTATA 
ATTGTATAAA 
CTCATTGTCA 



TGGTTCAAGT 
TGGGAGAAGC 
AACAGTTCGC 
GAAGGGAAGA 
ACACATGGTT 
GGCTAATAAA 
ATCTGGTGTG 
T 



AATGAGGAAA 
GGGTGGGGTG 



AGCTCAGAGC 
AAGATGTTGG 
ACTTCCAGGG 
TTAGGCATGG 
CTTAGTCGTG 
ACGCTTTGGG 
AGATGAGAGC 
CCTTCTAGTT 
GACCCTGGAA 



GATTATTGAC 
AGCCCATATA 
TGGCTGACCG 
TTCCCATAGT 
TATTTACGGT 
^m^^s 'X* A> C» ?3 G G C!j 
ATGCCCAGTA 
GTATTAGTCA 
TGCsGGGTGGA 
TTGAGGTCAA 
AAATGTGGTA 
AGGGTGGGAG 
C CTGG AGAGG 
CGATCGAGCC 
TGCGAAGAGT 
TGGGTCTTAT 
GCTTCCTTAT 
ATTGACGATT 
AJ^'^G G^^T*A.AG 
AT AGTCTGTG 
GGTGGCATTT 
TGCCGGGAGT 
GTACGTGTTG 
TCCGAGGCCT 

CCAGCAGTGT 
GAGCGTGGAG 
AGCGGCAGAA 
CAGAGGTAAG 
TGAGCAGTAG 
GAGTAACAGA 
GGATGTGTGA 
GAATTACACT 
ACTTGACTGT 
G C AAAGGGAG 
AATCGTCAGC 
CTTTATCTGT 
TATTCGTGAG 
AGGGTGGTTC 
TGTTCAAAGA 
AGGGAGTAAG 
GATGGTATAA 
AGAGGAAAAG 
GAG GAAGTT A 
AGAGGCTGAG 
TGTGGCTCTA 
CCACCCGTCA 
TAGCAACTAA 
ATGTATTGGA 
TGCTGGCTAC 
GTTTTGATAT 
GTTGCGGGCG 
TAAGACAGTG 
AAGTTTTGCG 
GCTGCAAGCG 
G AAATTATGA 
'T* G'X'i^.G i^kG.A'l! 6 G 
GGGAGGCATC 
GGTGCCACTG 



GGGGAGCACA GCAAGGGGGA 



TAGTTATTAA 250 
TGGAGTf CCG 300 
GGGAACG AGG 350 
AACGGGAATA 4 00 
AAACTGGCCA 4 S 0 
CCTATTGACG 500 
CATGACCTTA 550 
TCGCTATTAC 600 
TAGCGGTTTG 6 5 0 
TGGGAGTTTG 700 
ACAACTCCGC ISO 
GTCTATATAA 600 
CCATCCACGC 850 
TCCGC6GGGG 900 
GACGTAAGT A 350 
GCATGCTATA 1000 
GCTATAGGTG 10 50 
ATTGACCACT 1100 
ATGGCTCTTT I 15 0 
GTTCAGAGAC 1200 
ATTATTTAGA 1250 
TTTTATTAAA 1300 
GGGAGATGGG 135 0 
GGTGGGATGC 14 00 
ACAGTGGAGG 145 0 
GCCGCACAAG 1500 
ATTGGGCTCG 155 0 
GAAGATGGAG 1600 
TGCGGTTGCG 165 0 
TGGTTGCTGG 17 0 0 
GTGTTCCTTT 175 0 
AGTTGATATT 1800 
TAAAACGAGT 135 0 
AAAACTCTCA 19 0 0 
AAGAAAAGAT 195 0 
TG CACA&AGA 2 000 
TCGGGAATAC 2 050 
GAAAAACGAC 2100 
TGTTAGTGTT 215 0 
AAGGTCATGA 2200 
ACCAGACGGG 2250 
ATGGGTTGAG 23 00 
TACAATATGC 2350 
GATGATATGT 24 00 
TAAAAGCAAT 24 50 
AAGGGCGAAA 2S00 
CGTAAAATCT 2550 
GTTACCTGTT 26 00 
AGCGAATGCA 2650 
GGAGTAGGGG 2700 
CATGCTGGTA 2750 
TTGATGGTGA 2800 
AGAAATCGAA 28 5 0 
GGATTCTGGC 290 0 
TACTAGGTGA 2 550 
TAATGATCG A 3 0 0 0 
TGTGTGTTGG 30 5 0 
TGTTGTTTGC 3100 
CGACTGTGCT 3150 
AGGTGTC ATT 320 0 
GGATTGGGAA 3250 
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GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG GTACCTCTCT 3 3 00 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCGGTAC CTCTCTCTCT 3350 
CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCAGG TGCTGAAGAA 34 00 
TTGACCCGGT GACCAAAGGT GCCTTTTATC ATCACTTTAA AAATAAAAAA 34 SO 
CAATTACTCA GTGCCTQTTA TAAGCAGC&A TT&ATTATGA TTGATGCCTA 3 SCO 
CATCACAACA AA&ACTGATT TAACAAATGG TTGGTCTGCC TTAGAAAGTA 355 Q 
TATTTGAACA TTATCTTGAT TAT&TTATTG ATAATAATAA AAACCTTATC 3600 
CCTATCCAAG AAGTGATGGC TATCATTGGT TGGAATG&AC TTGAAAAAAA 3650 
TTAGCCTTGA ATACATTACT GGTAAGGTAft ACGCCATTGT CAGCAAATTG 3 700 
ATCCAAGAGA ACCAACTTAA AGCTTTCCTG ACGGAATGTT AATTCTCGTT 3750 
GACCCTGAGG ACTGATGAAT CCCCTAATGA TTTTGGTA&A AATCATTAAG 3000 
TTAAGGTGGA TACACATCTT GTCATATGAT CCCGGTAATG TGAGTTAGCT 3 850 
CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT 3 900 
TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA GAGCTATGAC 3 55 0 
CATGATTACG CCAAGCGCGC AATTAACCCT CACTAAAGGG AACAA&AGCT 4 000 
GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG &&CTAGTGGA TCCCCCGGGG 4 050 
AGGTCAGAAT GGTTTCTTTA CTGTTTGTCA ATTCTATTAT TTCAATACAC 4100 
AAC&ATAGCT TCTATAACTG AAATATATTT GCTATTGTAT ATTATGATTG 415 0 
TCCGTCGAAC CATGAACACT CCTCCAGCTG A&TTTCACAA TTCCTCTGTC 4 200 
ATCTGCCAGG CCATTAAGTT ATTCATGGAA GATCTTTGAG GAACACTGGA 4 250 
AGTTCATATC ATAAACACAT TTGAAATTGA GTATTGTTTT GCATTGTATG 4 3 00 
GAGCTATGTT TTGCTGTATC CTCAGAAAAA AAGTTTGTTA TAAAGCATTC 4 350 
AC&CCCATAA AAAGATAGAT TTAAATATTC CAGCTATAGG AAAGAAAGTG 4 4 00 
CGTCTGCTCT TCACTCTAGT CTCAGTTGGC TCCTTCACAT GCATGCTTCT 4 4 50 
TTATTTCTCC TATTTTGTCA AGAAAATAAT AGGTCACGTC TTGTTCTCAC 4 500 
TTATGTCCTG CGTAGCATGG CTCAGATGCA CGTTGTAGAT ACAAGAAGGA 4 550 
TCAAATGAAA CAGACTTGTG GTCTGTTACT ACAACCATAG TAATAAGCAC 4 600 
ACTAACTAAT AATTGCTAAT TATGTTTTCC ATCTCTAAGG TTCCCACATT 4 650 
TTTCTGTTTT CTTAAAOATC CCATTATCTG GTTGTAACTG AAGGTCAATG 4 700 
GA&C&TGAGC AATATTTCCC AGTCTTCTCT CCCATCCAAC AGTGCTGATG 4 750 
G ATT AG GAGA AGAGGCAGAA AACACATTGT TACCCAGAAT TAAAAACTAA 4 800 
TATTTGCTCT CCATTCAATC CAAAATGGAC CTATTGAAAC TAAAATCTAA 4 350 
CCCAATCCCA TTAAATGATT TCTATGGCGT CAAAGGTCAA ACTTCTGAAG 4900 
GGAACCTGTG GGTGGGTCAC AATTCAGGCT ATATATTCCC CAGGGCTCAG 4 950 
CGGAT.CCATG GGCTCCATCG GCGCAGCAAG CATGGAATTT TGTTTTGATG 5000 
T&TTCAAGGA GCTCAA&GTC CACCATGCCA ATGAGAACAT CTTCTACTGC 5050 
CCCATTGCCA TCATGTCAGC TCTAGCCATG GTATACCTGG GTGCAAAAGA 5X00 
CAGCACCAGG ACAGAGATAA ATAAGGTTGT TCGCTTTGAT AAACTTCCAG 5150 
GATTCGGAGA CAGTATTGAft GCTCAGTGTG GCACATCTGT AAACGTTCAC 52 00 
TCTTCACTTA GAGACATGCT CAACCAAATC ACC AAA CC AA ATGATGTTTA 5250 
TTCGTTCAGC CTTGGGAGTA GACTTTATGC TGAAGAGAGA TACCCAATCC 53 00 
TGCCAGAATA CTTGCAGTGT GTGAAGGAAC TGTATAGAGG AGGCTTGGAA 5 3 50 
CCTATCAACT TTCAAACAGC TGCAGATCAA GCCAGAGAGC TCATCAATTC 54 00 
CTGGGTAGAA AGTCAGACAA ATGGAATTAT CAGAAATGTC CTTGAGCCAA 54 50 
GCTCCGTGGA TTCTCAAACT GCAATGGTTC TGGTTAATGC GATTGTCTTC 5500 
AAAGG A GTGT GGGAGAAAAC ATTTAAGGAT GAAGACACAC A&GCA&TGCC 5550 
TTTCAGAGTG AGTGAGCAAG A&AGCAAACC TGTGCAGATG ATGTACCAGA 5600 
TTGGTTTATT TAGAGTGGGA TCAATGGCTT CTGAGAAAAT GAAGATCCTG 56 50 
GAGCTTCCAT TTGCCAGTGG GACAATGAGC ATGTT GGTGC TGTTGCCTGA 57 00 
TGAAGTCTCA GGCCTTGAGC AGCTTGAGAG TATAATCAAG TTTGA&AAAC 5750 
TGACTGAATG GACCAGTTCT AATGTTATGG AAGAGAGGAA GATCAAAGTG 58 00 
TACTTACCTC GCATGAAGAT GGAGGAAAAA TACAACCTCA CATCTGTCTT 5850 
AATGGCTATG GGCATTACTG ACGTGTTTAG CTCTTCAGCC AATCTCTCTG 5300 
GCATCTCCTC AGCAGAGAGC CTGAAGATAT CTCAAGCTGT CCATGCAGCA 5950 
CATGCAGAAA TCAATGAAGC AGGCAGAGAG GTGGTAGGGT CAGCAGAGGC 6 000 
TGGAGTGGAT GCTGCAAGCG TCTCTGAAGA &TTTAGGGCT GACCATCCAT 6 050 
TCCTCTTCTG TATCAAGCAC ATCGCAACCA ACGCCGTTCT CTTCTTTGGC 6100 
AGATGTGTTT CCCGTCCGCG GCCAGCAGAT GACGCACCAG CAGATGACGC 6150 
ACC AG C AG AT GACGCACCAG CAGATGACGC ACC AG C AGAT GACGCACCAG 6200 
CAGATGACGC AACAACATGT ATCCTGAAAG GCTOTTGTGG CTGGATCGGC 6250 
CTGCTGGATG ACGATGACAA ATTTGTGAAC CAACACCTGT GCGGCTCACA 6300 
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CCTGGTGG&A GCTCTCTACC TAGTGTGCGG GGAACG&GGC TTCTTCTACA 6350 

CACCC&AGAC CCGCGGGGAG GCAGAGGACG TGCAGGTGGG GCAGGTGGAG 64 00 

GTGGGGGGGG GCCCTGGTGC AGGCAGCCTG CAGCCCTTGG CCCTGGAGGG 64 SO 

GTCCCTGCAG AAGCGTGGCA TTGTGGAACA ATGCTGTACC AGCATCTGCT 650© 

GCGTCTACCA GCTGGAGAAC TACTGCAACT AGGGCGCCTG GATCCAGATC 6550 

ACTTCTGGCT AATAAAAGAT C&GAGCXCTA GAGATCTGTG 7GTTGGTTTT 6600 

TTGTGGATCT GCTGTGCCTT CTAGTTGCCA GCC ATCTGTT GTTTGGCCCT 6650 

CCCGCGTGCC TTCCTTGACG CTGGAAGGTG CGAGTCCCAC TGTCCTTTCC 6700 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT 6750 

TCTGGGGGGT GGGGTGGGGC AGCACAGCAA GGGGG&GGAT TGGGAAGACA 6800 

ATAGCAGGCA TGCTGGGGAT GCGGTGGGCT CTATGGGTAC CTCTCTCTCT 6050 

CT CTCTCTCT CTCTCTCTCT CTCTCTCTCT CGGTACCTCT CTCGAGGGGG 6 900 

GGCCCGGTAC CCAATXCGGC CXATAGTGAG TCGTATTACG CGCGCTCACT 6950 

GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCC&AC 7000 

TTAATCGCCT TGCAGCACAT CGCCCTTTCG CCAGCTGGCG TAATAGCGAA 7050 

GAGGCCCGCA CCGATCGCCC TTGCCAACAG TTGCG CAGCC TGAATGGCGA 7100 

ATGGAAATTG TAAGCGTTAA TATTTTGTTA AAATTCGCGT TAAATTTTTG 7150 

TTAAATCAGC TC&TTTTTTA ACCAATAGGC CGAAATCGGC AAAATCCCTT 72G0 

ATAAATCAAA AGAATAGACC GAGATAGGGT TGAGTGTTGT TCCAGTTTGG 7250 

AACAACAGTC CACTATXAAA GAACGTGGAC TCCAACGTCA AAGGGCG AAA 7300 

AACCGTCTAT CAGGGCGATG GCCCACTACT CCGGGATCAT ATGACAAGAT 7350 

GTGTATCCAC CTTAACTTAft TGATTTTTAC CAAAATCATT AGGGGATTCA 74 00 

TCAGTGCTCA GGGTCAACGA GAATTAACAT TCCGTCAGGA AAG CTT ATG& 74 50 

TGATGATGTG CTTAAAAACT T&GTCAATGG CTGGTTATGC ATATCGCAAT 7500 

AC ATG CG AAA AACCTAAAAG AGCTTGCCGA TAAAAAAGGC CAATTTATTG 7550 

CTATTTACCG CGGCTTTTTA TTGAGCTTGA AAGATAAATA AAATAGATAG 76 00 

GTTTTATTTG AAGCTAAATC TTCTTTATCG TAAAAAATGC CCTCTTGGGT 7 6 5 0 

TATCAAGAGG GTCATTATAT TTCGCGGAAT AACATCATTT GGTGACGAAA 7700 

TAACTAAGCA CTTGTCTCCT GTTTACTCCC CTGAGGTTGft GGGGTTAACA 7750 

TGAAGGTCAT CGATAGCAGG ATAATAATAC AGTAAAAGGC TAAACCAATA 7800 

ATCCAAATCC AGCCATCGGA AATTGGTAGT GAATG&TTAT AAATAACAGC 7 350 

AAACAGTAAT GGGCCAATAA CACCGGTTGC ATTGGTAAGG CTCACCAATA 7900 

ATCCCTGT&A AGCACCTTGC TGATGACTCT TTGTTTGGAT AGACATCACT 7350 

CCCTGTAATG CAGGTAA&GC GATCGCACCA CCAGCCAATA AAAT^AAAAC 8000 

AGGGAAAACT AACCAAGCTT CAGATATAAA CGCTAAAAAG GCAAATGCAC 8050 

TACTATCTGC AAT&AATCCG AGGAGTACTG CCGTTTTTTC GCCCCATTTA 8 i 0 0 

GTGGCTATTC TTCCTGCCAC AAAGGCTTGG AATACTGAGT GTAAAAGACC 8150 

AAGACCCGCT AATGAA&AGC CAAGCATCAT GCTATTCCAT CCAAAACGAT 8200 

TTT CGGT AAA TAGCACCCAC ACCGTTGCGG GAATTTGGCC TATCAATTGC 8250 

GCTGAAAAAT AAATAATCAA CAAAATGGCA TCGTTTT AAA T AAAGTG ATG 8300 

TATACCGAAT TCAGCTTTT G TTCCCTTTAG TGAGGGTTAA TTGCGCGCTT 8350 

GGCGTAATCA TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGGTGA 84 00 

CAATTCCACA C AA CAT ACGA GGCGGAAGCA TAAAGTGTAA AGCCTGGGGT 8450 

GCCTAATGAG TGAGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC 8500 

TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC 8 550 

GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTC 86 00 

ACTGACTCGC TGCGCTGGGT GGTTGGGCTG CGGCGAGCGG TATCAGCTCA 8 6 5 0 

CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT AACGCAGGAA 8700 

AGAACATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG TAAAAAGGCC 8750 

GCGTTGCTGG CGTTTTTCCh TAGGGTCCGC CCCCCTGACG AGCATCACAA ©800 

AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT 8850 

ACCAGGCGTT TCCCCCTGGA AGCTGGGTCG TGCGCTCTCC TGTTCCGACC 8 900 

CTGCCGCTTA CCGGATACGT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC 8 350 

GCTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG T AGGTCGTTC 9000 

GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC CGACCGCTGC 9050 

GCCTTATCGG GTAACTATCG TCTTGAGTCC AACGCGGTAA. GACACGACTT 9100 

ATCGCCACTG GCAGCAGCCA CTGGTAAC AG GATTAQCAGA GCGAGGTATG 9150 

TAGGCGGTGC TA.CAGAGTTC TTG AAGTGGT GGCCTAACTA CGGCTACAGT 3200 

AGAAGGACAG TATTTGGTAT CTGCGCT CTG CTGAAGCCAG TTACGTTCGG 9250 

AAAAAGAGTT GGTAGCTCTT GATCCGGCAA ACAAACCAGC GCTGGTAGCG 9300 

GTGGTTTTTT TGTTTGGAAG GAGCAGATTA CGCGCAGAAA AAAAGGATCT 5350 
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CAAGAAGATC CTTTGATCTT TTCTACGGGG 
AAACTCACGT TAAGGGATTT TGGTCATGAG 
CGTAGATCCT TTTAAATTA& &&ATGAAGTT 
TATGAGTAAA CTTGGTCTGA CAGTTACGAA 
TATCTCAGCG ATCTGTCTAT TTCGTTCATC 
TCGTGTAGAT AACTACGATA CGGGAGGGCT 
GCAATGATAC GGCGAGACCC ACGCTCACCG 
AAACCAGCC& GGCGGAAGGG CCGAGCGCAG 
CCGCCTCCAT CCAGTCTATT AATTGTTGGC 
TCGCCAGTTA ATAGTTTG CG CAACGTTGTT 
GGTGTCACGC TCGTCGTTTG GTATGGCTTC 
GATCAAGGCG AGTTACATG& TCCCCCATGT 
TCCTT GGGTC CTCCG&TGGT TGTCAGAAGT 
ACTCATGGTT ATGGCAGCAC TGCATAATTC 
TAAGATGCTT TTCTGTGACT GGTGAGTACT 
TAGT GTATGC GGCGACCGAG TTGCTCTTGC 
TACCGCGCCA CATAGCAGAA CTTTAAAAGT 
CTTCGGGGCG AAAACTCTCA AGGATCTTAG 
ATGTAACCCA CTCGTGCACC CAACTGATCT 
CAGCGTTTCT GGGTGAGCAA AAACAGGAAG 
GAATAAGGGC GACACGGAAA TGTT G A&T AC 
TATTATTGAA GCATTTATCA GGGT TATTGT 
TGAATGTATT TAGAAAAATA A&CAAATAGG 
GAAAAGTGCC AC 



TCTGACGCTC AGTGGAACGA 94 00 
ATTATCAAAA AGGATCTTCA 94 5 0 
TTAAATCAAT CTAAAGTATA 95 00 
TGCTTAATCA GTGAGGCACC 9550 
C&TAGTTGCC TGACTCCCCG S&QQ 
TACCATCTGG CCCGAGTGCT 9650 
GCTCCAGATT TATGAGCAAT 9 7 00 
AAGTGGTCCT GCAACTTTAT 975 0 
GGGAAGCTAG AGTAAGTAGT 9800 
GGCATTGGTA CAGGCATCGT 985 0 
ATFCAGCTCC GGTTCGCAAC 9900 
TGTGCAAAAA AGCGGTTAGC 99 SO 
AAGTTGGCCG CAGTGTTATC 10000 
TCTTACTGTC ATGCCATCCG 100S0 
CAACCAAGTC ATTCTGAGAA 10100 
CCGGCGTCAA TACGGGATAA 10150 
GCTCATCATT GGAAAACGTT 102 00 
CG CTGTTG AG ATCCAGTTCG 10250 
TCAGCATCTT TTACTTTCAC 103 00 
GCAAAATGCC GCAAAAAAGG 10350 
TCATACTCTX CCTTTTTCAA 104 00 
CTCATGAGCG GATACATATT 104 SO 
GGTTCGGGGC ACATTTCCCC 10500 

10512 



SEQ ID &0;32 { pTnMod C Ova 1 / ENT tag/Froins/PA) - QVAIW 



CTGACGCGCC CTGTAGCGGC GCATTAAGCG 
CGCAGCGTGA CCGCTACACT TGCCAGCGCG 
TTTCTTCCCT TCCTTTCTCG CCACGTTCGC 
GGCATTGCAT ACGTTGTATC CATATCATAA 
CATGTCCAAC ATT AGGGGCA TGTTGACATT 
TAGTAATCAA TTACGGGGTG ATTAGTTCAT 
CGTTACATAA CTTACGGTAA ATGGCCCGCC 
CCCGCCCATT GACGTCAATA ATGACGTATG 
GGGAGTTTCG ATTGACGTGA ATGGGTGGAG 
CTTGGCAGTA CATCAAGTGT ATCATATGCC 
TCAATGACGG TAAATGGGCG GCCTGGCATT 
TGGGACTTTC GTACTTGGCA GTACATCTAC 
CATGGTGATG CGGTTTTGGC AGTACATCAA 
ACTG&CGGGG ATTTC C AAGT CTCCACCCCA 
TTTTGGCAGC AAAATCAACG GGACTTTCCA 
CCCATTGACG CAAATGGGCG GTAGGCGTGT 
GCAGAGCTCG TTTAGTGAAC CGTCAGATCG 
TGTTTTGAGC f CCATAGAAG ACACCGGGAC 
GGAACGGTGC ATTGGAACGC GGATTCCCCG 
CCGCCTATAG ACTCTATAGG CACACCCCTT 
CTGTTTTTGG CTTGGGGCCT ATACAGCCCC 
ATGGTATAGC TTAGCCTATA GGTGTGGGTT 
CCCCTATTGG TGACGATACT TTGCATTACT 
GCCACAACTA TCTCTATTGG CTATATGCCA 
TGACAGGGAC TCTGTATTTT TACAGGATGG 
A&TTCACATA TACAACAACG CCGTCCCCCG 
CATAGCGXGG GATCTCCACG CGAATCTCGG 
CTCTT.CTCCG GTAGCGGCGG AGCTTCCACA 
CTCCAGCGGC TCATGGTCGC TCGGC&GCTC 
CCAGACTTAG GGACAGCAC& ATGCCCAGCA 
GCCGTGGGGG TAGGGTATGT GTCTGAAAAT 
CACGGCTGAC GCAG&TGGAA GAGTTAAGGC 
GCAGCTGAGT TGTTGTATTC TGATAAGAGT 



CGGCGGGTGT GGTGGTTACG SO 
GTAGCGCCCG CTCCTTTCGC 100 
CGGCATCAGA TTGGCTATTG 150 
TATGTACATT TATATTGGCT 200 
GATTATTGAC TAGTTATTAA 250 
AGCCCATATA TGGAGTTCCG 300 
TGGCTGACCG CCCAACGACC 350 
TTCCCATAGT AACGCCAATA 4 00 
TATTTACGGT AAACTGCCCA 45 0 
AAGTACGCCC CCTATTGACG 500 
ATGCCCAGTA CATGACCTTA 55 0 
GTATTAGTCA TCGCTATTAC 6 00 
TGGGCGTGGA TAGCGGTTTG 65 Q 
TTGAGGTCAA TGGGAGTTTG 70 0 
AAATGTCGTA ACAACTCCGC ?5 0 
ACGGTGGGAG GTCTATATAA 800 
CCTGGAGAGG GCATCCACGC 85 0 
CGATCCAGCC TCCGCGGCCG 900 
TGCCAAGAGT GAGGTAAGTA 95 0 
TGGCTCTTAT GCATGCTATA 1000 
GGTTC CTTAT GCTATAGGTG 105 0 
ATTGACCATT ATTGAC C ACT 1100 
AATCGATAAG ATGGCT CTTT 1150 
ATACTCTGTC CTTCAGAGAC 12 00 
GGTCC C ATTT ATTATTTACA 125 0 
TGCCCGCAGT TTTTATTAAA 13 00 
GTACGTGTTC CGGAC ATGGG X 3 S 0 
TCCGAGCCGT GGTCCCATGC 14 00 
CTTGCTCCTA ACAGTGGAGG 14 50 
CCACCAGTGT GCCGCACAAG 1500 
GAGCGTGGAG ATTGGGCTCG 15S0 
AGCGGCAGAA GAAGATGCAG 1600 
CAGAGGTAAC TCCCGTTGCG IS SO 
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5 



10 



15 



20 



25 



30 



35 



45 



t>0 



GTGCTGTTAA 
CGCGCGCGCC 
CC ATGGGTCT 
TTACATG&TT 
CAACAGCTTA 
CTCTTACCGA 
AACATCAAAC 
GCGACTGGGT 
GATGCCCATT 
TTATGGTATT 
TATGAGAAAG 
CCAATXTCTA 
TCATTGTCAG 
AAGCTGGGTT 
AGACCTAGGA 
CATCTAGTCA 
CCAATCTCAT 
AAATCAGCGC 
ACTCAGCGTC 
GAAATTCGAA 
GATTGAAGAA 
TACGCCATAG 
ATCGCCCTGA 
GAAACA&GGT 
ACGTACTCTC 
T A C ACT AAT A^f-t 
AAATTTATTC 
GATCACTTCT 
TTTTTTGTGG 

CTATTCTGGG 
GACAATAGCA 
CTCTCTCTCT 
CTCTCTCTCT 
TTGACGCGGT 
CAATTACTCA 
CATCACAACA 
TATTTGAACA 
CCTAT GC AAG 
TTAGCCTTGA 
ATCCAAGAGA 
GACCCTGAGC 
XTAAGGTGGA 
CACTCATTAG 
TGTGTGGAAT 
CATGATTACG 
GGAGCTCCAC 
AGGTCAG&AT 
AAC AA SAG CT 
TCCCTCGAAC 
ATCTGCCAGG 
C<?VT A C v» jA1£*^^A 
TATGTTTTGC 
CCATC^AAAG 
TCTTCACTCT 
CCTAT fTTGT 
TGCCTAGC&T 
AACAGACTTC 
ATAATTGCTA 
TTA&GATCCC 



CGGTGGAGGG 
ACCAGACATA 
TTTCTGCAGT 



GAGTGTAGTC 
AT AG CTG ACA 



ACTTGGCCGT 

GAATCGACCG 

GTATACCGTT 

GTACTTGTTG 

GCGAGCTTCA 

CGTTCCCGCT 

GCCGACCTTG 

TGATOCTGGC 

GGTACTGGTT 

GCGGAA&ACT 

CTCAAAGACT 

GCCAAATTCT 

TCGACACGGA 

GGCAAAGGAG 

CACCCAftACA 

ACCTTCCGAG 

CCGAACGAGC 

TGCTTCAACT 

TGGGACAAGC 

AACAGTTCGC 

GAAGGGAAGA 

AC AC &TGGTT 

GGCTAATAAA 

ATCTGCTGTG 

TGCCTTCCTT 

AATGAGGAAA 

GGGTGGGGTG 

GGCATGCTGG 

CTCTCTCTCT 

CTCTCTCTCT 

GACCAAAGGT 



AACCTGCCAA 
ATTGTTAGGT 
GGCATGCTAG 
ACTGGTCTGA 
GTCGCACTAC 
TTCAGAGCAA 



TTTAAAGTGC 



AAAACTGATT 
TTATCTTGAT 
AAGTGATGCC 
AT AC ATT ACT 
ACCAACTTAA 
ACTGATGAAT 
TACACATCTT 
GCACCCCAGG 
TGTGAGCGGA 
CCAAGGGGGC 
CGCGGTGGGG 
GGTTTCTTTA 
TCTATAACTG 
CATGAACACT 
CTGGAkGATG 
ACTCATTTGG 
&GTTCCCTCA 
ATATATTTAA 
G&TCTCAGTT 



GGCTTAGATG 



ATTATGTTTT 
ATTATCTGGT 



TGAGCAGTAC 
C5.A CJ^&'AC AG A 
CCATGTGTGA 
GAATTACACT 
ACTTGACTGT 
CC&AAGCGAG 
AATCGTCACC 
CTTTATCTGT 
TATTCGTGAG 
ACGGTCGTTC 
TGTTC&AAGA 
ACCGAGTAAC 
CATGGTATAA 
AGAGGAAA&G 
CAGCAACTTA 
AGAGGCTGAC 
TCTCGCTCTA 
CCACCCGTC& 
TAGCAACTAA 
ATCTATTCGA 
TCCTGCCTAC 
GTTTTGATAT 
CTTGCGGGCG 
TAACACAGTC 
AAGTTTTGCG 
GCTGCAACCC 
GAAATTATGA 
TCTAG&GATC 
GCCAGCCATC 
GGTGCCACTC 
TTGTCTGAGT 
GCAAGGGGQA 
GGCTCTATGG 
CTCTCGGTAC 
CGGTACCAGG 
ATCACTTTAA 
TTAATTATGA 
TTGGTGTGCC 

TGGAATGAAC 

ACGCCATTGT 

ACGGAATGTT 

TTTTGGTAAA 

CCCGGTAATG 

TATGCTTCCG 

AGAGAGGAAA 

C&CTAA&GGG 

AACTAGTGGA 

ATTCTATTAT 

GGTATTGTAT 
AATTTCAGAA 

TCTGAGG&&C 
TATTTTGCTT 
TTGTTATA&A 
TACAGAAAGA 
CGTACATGCT 
ATAGGTCAAG TCCTGTTCTC 



TCGTTGCTGC 
CTGTTCCTTT 
ACTTGATATT 
TAAAACGACT 
Ap<j j ^A^>C"T C TC A. 



GGAAACCTAT 
TTAGGCTATA 
ATTGTATAAA 
CTCATTGTCA 
CCATGGGTTC 
ACTTGTTAAT 
ACTTGAAAAG 
AGCTCAGAGC 
AACATGTTGG 
AGTTCCAGGC 
TTAGGCATGG 
CTTACTCGTG 
ACGCTTTGGG 
AGATCAGAGC 
CCTTCTAGTT 
GACCCTGGAA 
TTGCATCGCA 
GGGCAGCACA 
GGATGCGGTG 
CTCTCTCTCT 
CTCTCTCTCT 
GCCTTTTATC 
TAAGCAGCAA 
TAACAAATGG 
TATATTATTG 
TATGATTGGT 
GGTAAGGTAA 
AGCTTTCCTG 
CCCCTAATGA 
GTCATATGAT 
CTTTACACTT 
TAACAATTTC 
AATTAftvCCCT 
G CCGCTCT AG 
CTGTTTGTCA 
AAATATATTT 
CCTCCftGCTG 
ATGGAAGATC 
AATTGAGTAT 
GAAGAAAAGC 
ATATTCCAAC 
GGTTTCTTCA 



CACGTTGTAC 




CCATCTCTAA GGTTCCCACA 
TGTAACTGAA GCTCAATGGA 



TGCACAAAGA 
TCGGGAATAC 
CAAAAACGAC 
TGTTACTCTT 
AAGCTCATG^ 
C!?J^i^>D«^^^ur CC3 G 
ATCCGTTGAG 
TA.CAATATGC 
CATGATATGT 
TAAAAGCAAT 
AAGGCCGAAA 
CCTAAAATCT 
CTTACCTGTT 
AGCGAATGCA 
GGACTAGGGC 

V.jH 1 ijs^ A. i. A 

TTCATGCTC& 
A.GAJ^uATGGA\A 
GGATTCTGGC 
TACTAGCTCA 
TAATGATCCA 
TGTGTGTTGG 
TGTTGTTTGC 
CCACTGTCCT 
AGGTGTCATT 
GGATTGGGAA 
GTACCT CTCT 

TGCTGAAGAA 
AAATAAAAAA 
TTGATGCCTA 
TTAGAAAGTA 
AAACCTTATC 
TTGAAAAAAA 
C^^-^y G AJ^AtT TG 
AATTCT CGTT 

TGAGTTAGCT 
GCTCGTATGT 
CAGCTATGAC 

TCCCCCGGGG 
TTCAATACAG 
ATTATGATTG 
TTCCTCTGTC 
ATTGCAAGTT 
TGAATGGAGC 
GCGTCTACAG 
TTTTGTCTGC 
TCTTTATTTG 
ACTTATCTCC 
GATCAAATGA 

TTTTTCTGTT 
ACATGAACAG 



1700 
I7S0 
1800 
1850 
1900 
X$S© 
2000 
20S0 
2100 
2150 

22S0 
2300 
23S0 
2400 
2450 
2500 
2550 
2600 
2650 
2700 
2750 
28 QO 
2850 
2900 
2950 
3000 
30SQ 
3100 
31S0 
3200 
3250 
3300 
3350 
3400 
3450 
3500 
3550 
3600 
3 650 
37 00 

3 750 
3800 
3850 
3900 
3950 
4000 
4050 
4100 
4150 
4200 
4250 
4300 

4 350 
4400 
4 450 
4 500 
4550 
4600 
4650 
4 700 
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tatttctcag tcttttctcc agcaatcctg acggattaga agaactggga 4 7 50 
gaaaacactt tgttacccag aattaaaaac taatatttgc tctcccttca 4 800 
atccaaa&tg gacctattga aactaaaatc tgaccc&atc ccattaaatt 4ss0 
atttctatgg cgtcaa&ggt caaftcttttg aagggaacct gtgogtgggt 4 3>00 
cccaattc&g gctatatatt ccccagggct cagccagtgg atccatgggc 4950 
tccatcggxg cag caagcat ggaattttgt tttgatgtat tcaaggagct 5000 

CAAAGTCCAC CATGCCAATG ACAACATGGT CTACTCCCCC TTTGCCATCT 5050 
TGTCAACTCT GGCCATGGTC TTCCTAGGTG CAAAAGACAG CACCAGGACC 5100 
CAGATAAATA AGGTTGTTCA OTTTGATAAA CTTCCAGGAT TCGGAGACAG 52 50 
TATTGAAGCT CAGTGTGGCA CATCTGTAAA TGTTCACTCT TGACTTAGAG 5200 
ACATACTCAA CCAAATCACC AAftCAAAATG ATGCTTATTC GTTCAGCCTT 5250 
GCCAGTAGAC TTTATGCTCA AGAGACATAC ACAGTCGTGC CGGAATACTT 53 00 
GCAATGTGTG AAGGAACTGT ATAGAGGAGG CTTAGAATCC GTCAACTTTC 53 50 
AAACAGCTGC AGATCAAGCC AGAGGCCTCA TCAATGCCTG GGTAGAAAGT 54 00 
CAGACAAACG GAATTATCAG &AACATCCTT CAGCCAAGCT CCGTGGATTC 54 50 
TCAAACTGCA ATGGTCCTGG TTAATGCCAT TGCCTTCAAG GGACTGTGGG 5500 
AGAAAGCATT TAAGGCTGAA GACACGCAAA CAATACCTTT CAGAGTGACT 5550 
GAGCA&GAAA GCAAACCTGT GCAGATGATG TACCAGATTG GTTCATTTAA 5 6 GO 
AGTGGCATCA ATGGGTTCTG AG&AAATGAA GATCCTGGAG CTTCCATTTG 5650 
CCAGTGGAAC AATGAGCATG TTGGTGCTGT TGCCTGATGA TGTCTCAGGC 5700 
CTTGAGGAGC TTGAGAGTAT AATCAGCTTT GAAAAACTGA CTGAATGGAC 5750 
CAGTTCTAGT ATTATGGAAG AGAGGAAGGT CAAAGTGTAC TTACCTCGCA 5 800 
TGAAGATGGA GGAGAAATAC AACCTCACAT CTCTCTTAAT GGCTATGGGA 5350 
ATTAGTGACC TGTTCAGCTC TTCAGCCAAT CTGTCTGGGA TCTCCTCAGT 5 900 
AGGGAGCCTG AAGATATCTC AAGCTGT CCA TGCAGCACAT GCAGAAATCA 5 950 
ATGAAGCGGG CAGAGATGTG GTAGGCTCAG CAGAGGCTGG AGTGGATGCT 6 000 
ACTGAAGAAT TTAGGGCTGA CCATCGATTC CTCTTCTGTG TCAAGCACAT 6050 
CGAAACCAAC GCCATTCTCC TCTTTGGCAG ATGTGTTTCT CCGCGGCCAG 6100 
CAGATGACGC AGGAGCAGAT GACGCACCAG CAGATGACGC ACCAGCAGAT 6150 
GACGCACCAG CAGATGACGC ACCAGCAGAT GACGCAACAA CATGTATCCT 6200 
GAAAGGCTCT TGTGGCTGGA TCGGCCTGCT GGATGACGAT GACAAATTTG 62 50 
TGAACGAAGA CCTGTGCGGC TCACACCTGG TGGAAGCTCT CTACCTAGTG 63 00 
TGCGGGGAAC GAGGCTTCTT CTACACACCC AAGACCCGCC GGGAGGCAGA 6 35 0 
GGAGGTGCAG GTGGGGCAGG TGGAGCTGGG CGGGGGCCCT GGTGCAGGCA 64 00 
GCCTGCAGCC CTTGGCCCTG GAGGGGTCCC TGCAGAAGCG TGGCATTGTG 64 50 
GAACAATGCT GTACCAGCAT CTGCTCCCTC TACCAGCTGG AGAACTACTG 6500 
CAACTAGGGC GCCTGGATCC AGATCACTTC TGGCTAATAA AAGATCAGAG 6550 
CTCTAGAGAT CTGTGTGTTG GTTTTTTGTG GATCTGCTGT GCCTTCTAGT 66 00 
TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT TGACCCTGGA 665 0 
AGGTGCCACT CGC ACTGTCC TTTCCT&ATA AAATGAGGAA ATTGCATCGC 6700 
ATTGTCTGAG TAGGTGTCAT TCTATTCTGG GGGGTGGGGT GGGGCAGCAC 6750 
AGCAAGGGGG AGGATTGGGA AGACAATAGC AGGCATGCTG GGGATGCGGT 6 8 00 
GGGCTCTATG GGTACCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC 6 850 
TCTCTCGGTA CCTCTCTCGA GGGGGGGCCC GGTACCCAAT TCGCCCTATA 6 900 
GTGAGTCGTA TTACGCGCGC TCACTGGCCG TCGTTTTACA ACGTCGTGAC 6 950 
TGGGA&AACC CTGGCGTTAC CCAACTTAAT CGCCTTGCAG CACATCGCCC 7000 
TTTCGCCAGG TGGGGTAATA GCGAAGAGGC CCGCACCGAT CGCCCTTCCC 7050 
AACAGTTGCG CAGCCTGAAT GGCGAATGGA AATTGTAAGC GTTAATATTT 7100 
TGTTAAAATT GGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA 7 ISO 
TAGGCCGAAA TCGGCAAAAT CCCTTATAAA TCAAAAGAAT AGACCGAGAT 7200 
AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA TTAAAGAACG 7250 
TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATGAGGG CGATGGCCCA 73 00 
CTACTCCGGG ATCATATGAC AAGATGTGTA TCCACCTTAA CTTAATGATT 7350 
TTTACCAAAA TCATTAGGGG ATTCATCAGT GCTCAGGGTC AACGAGAATT 74 00 
AACATTCCGT CAGGAAAGCT TATGATGATG ATGTGCTTAA AAACTTACTC 7450 
AATGGCTGGT TATGCATATC GCAATACATG CGAAAAACCT AAAAGAGCTT 7500 
GCCGATAAAA AAGGCCAATT T&TTGCTATT TACCGCGGCT TTTTATTGAG 7550 
CTTGAAAGAT AAATAAAATA GATAGGTTTT ATTTGAAGCT AAATCTTCTT 7600 
TATGGTAAAA AATGCCCTCT TGGGTTATCA AGAGGGTCAT TATATTTCGC 7650 
GGAATAACAT CATTTGGTGA CGAAATAACT AAGCACTTGT CTCGTGTTTA 7700 
CTCCCCTGAG CTTGAGGGGT TAACATGAAG GTCATCGATA GCAGGATAAT 77S0 
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GTAG7GAATG 
GTTGCATTGG 
ACTCITTGTT 
CA'u* GAG GAkG G 
ATAAACGCTA 
TACTGCCGTT 
CTTGGAATAC 
ATCATGCTAT 
TGCGGGAATT 
TGCCATCGTT 
TTTAGTGAGG 
CCTGTGTGAA 
AAGCATAAAG 
TAATTGCGTT 
GAGCTGCATT 
TGGGCGGTGT 
GGCTGCGGCG 
AC AG A. AT C AG 
AAAGGCCAGG 
TCCGCCGCGG 
CGAAACCCGA 

CCTTTCTCCC 
TATCTCAGTT 
ACCCCCCGTT 
A.GT CG AA«GGG 
AAGAGGATTA 
GTGGTGGCCT 
CTGTGCTGAA 
GGCAAACAAA 
GATTACGCGC 
CGGGGTCTGA 
ATG&GATTAT 
AAGTTTTAAA 
ACCAATGCTT 
TGAtCCATAG 
GGGCTTACCA 

CG GAG AAGTG 
TTGCCGGGAA 
TTGTTGCCAT 
GCTTCATTCA 
CATGTTGTGC 
GAAGTAAGTT 
AATTCTCTTA 
GTACTCAACC 
CTTGCCCGGC 
AAAGTGCTCA 
CTTACCGCTG 
GATCTTCAGC 
GGAAGGCAAA 
AAT&CTCAT& 
ATTGTCTCAT 
ATAGGGGTTG 



AACGCTAAAC 
ATT AT A A AT A 
TAAGGCTCAC 
TGGATAGACA 



CAATAATCCA 
AG^AG G-AiAtAG Ai 
CAATAATCCC 
TGACTCCCTG 
AAAACAGGGA 



AAAAGGGAAA 

TTTTCGCCCC 

TGAGTGTAAA 

TCCATCCAAA 

TGGCGTATCA 

TTAAATAAAG 

GTTAATTGCG 

ATTGTTATCC 

TGTAAAGCCt 

GCGCTCAC'TG 

AATGAATCGG 

TCCGCTTCCT 

AGCGGTATCA 

GGGATAACGG 

AACCGTAAAA 

TGACGAGCAT 

CAGGACTATA 

TCTCCTGTTC 

TTCGGGAAGC 

GGGTGTAGGT 

CAGCCCGACC 

GGTAAGAGAG 

GCAGAGCGAG 

AACTACGGCT 

GCCAGTTAGC 

CCACCGCTGG 

AGAAAAAAAG 

CGCTCAGTGG 

G ^kJ^^^cA^f^CSCS.^^ i 

TCAATCTAAA 

A&TCAGTGAG 

TTGCCTGACT 

TCTGGCCCCA 

AGATTTATCA 

GTCCTGCAAC 

GCTAGAGTAA 

TGCTACAGGC 

GCTCCGGTTC 

AAAAAAGCGG 

GG C C G C AGTG 

CTGTCATGCC 

AAGTCATTCT 

GTCAATACGG 

TCATTGGAAA 

TTGAGATCCA 

ATCTTTTACT 

ATGCCGCAAA 

CTCTTCCXTT 

GAGCGGATAC 

CGCGCACATT 



ATTTAGTGGC 
AG AG G AAG A G 
ACGATTTTCG 
ATTGCGCTGA 
TGATGTAT AG 
CGCTTGGCGT 



AATCCAGCCA 
GTAATGGGCC 
TGTAAAGCAC 
TAATGCAGGT 
AAACTAACCA 
TCTGCAATAA 
TATTCTTCCT 
CCGCTAATGA 
GTAAATAGCA 
A&AATAAATA 
GGAATTCAGC 
AATCATGGTC 
GGAGAGAAGA 
ATGAGTGAGC 
CCCGCTTTCC AGTCGGGAAA 

GGGAGAGGCG 
CTCGCTGCGG 
AGGCGGTAAT 
ATGTGAGGAA 
GGTGGCGTTT 
GACGCTCAAG 
GCG TTTCCCC 
GCTTACCGGA 
GTCATAGCTC 
AAGCTGGGGT 
ATCCGGTAAC 
CACTGGGAGC 
GGTGCTACAG 
GACAGTATTT 
GAGTTGGTAG 
TTTTTTGTTT 
AGATCCTTTG 
CACGTTAAGG 
ATCCTTTTAA 
GTAAACTTGG 
CAGCGATCTG 
TAGATAACTA 
GATAGCGCGA 
AGCCAGCCGG 
TCGATGCAGT 
AGTTAATAGT 
CACGCTGGTC 
AGGCGAGTTA 
CGGTGGTCGG 
TGGTTATGGC 
TGGTTTTCTG 
TATGCGGCGA 
CGCCACATAG 
GGGCGAAAAC 

TTTCTGGGTG 
AGGGCGACAC 
TTGAAGCATT 
GTATTTAGAA 
TCCCCGAAAA QTGCCAC 



GGCTGACTGA 
GCTGACTCAA 

■ "JV"** 0 * <**x y?*>! 4^*!* 

AAGATAGGAG 
CGACCCTGCC 
GTGGGGGTTT 
CGTTCGCTCC 

GACTTATCGC 
GTATGTAGGC 
ACACTAGAAG 
TTCGGAAAAA 



GATCTCAAGA 



CTTC AC CT AG 
QTATATATGA 
GGACGTATCT 
G GO CG^.r CGV.rG 
GTGCTGCAAT 
GG AkA°X*,AAAvG G 



GTAGTTCGCC 



TTATGACTGA 
ATCCGTAAGA 
GAGAATAGTG 
GATAATACCG 
ACGTTCI V TCG 



TCCCAAATTG 7800 
AATAACAGCG 
CTTGCTGATG 7900 
AAAGCGATGC 7950 
ACCTTGAGAT 8000 
ATCCGAGCAG 8050 
GCC ACAAAGG 8100 
AAAGCCAACC 8150 
CCCAGACCGT S200 
ATCAACAAAA 8250 
TTTTGTTCGG 8300 
ATAGCTGTTT 83 50 
TACGAGCCGG 84 00 
TAACTCACAT 8450 
GCTGTCGTGC 8500 
GT1TGCGTAT 8550 
TCGGTGGTTC 8600 
ACGGTTATGC 8650 
AAGGGGAGCA 8 7 00 
TTCCATAGGG 8750 
TCAGAGGTGG 88 00 
CTGGAAGCTG 8850 
TAGCTGTCCG 89 00 
ACGCTGTAGG 8 950 
GTGTGC ACGA 9000 
TATCGTCTTG 9050 
AGGGACTGGT 9100 
AGTTCTTGAA 9X50 
GGTATCTGCG 9500 
CTCTTGATCC 9250 
GCAAGCAGCA 93 00 
ATCTTTTCTA 93 50 
GATTTTGGTC 9400 
ATTAAMATG 9450 
TCTGACAGTT 9500 
T CTATTT CGT 9550 
CGATACGGGA 9600 
GACCCACGCT 9650 
AAGGGG CGAG 9700 
CTATTAATTG 9750 
TTGCGGAACG 9800 
GTTTGGTATG 9850 
CATGATCCCC 9900 
ATCGTTGTCA 9950 
AGCACTGCAT 10000 
TGACTGGTGA 10050 
CCGAGTTGCT 10100 
C AGAAGTTT A 10150 
TCTCAAGGAT 10200 
GCACGGAACT 1025 0 
AGCAAAAAGA 10300 
GGAAATGTTG 10 35 0 
TATCAGGGTT 10400 
AAATAAACAA 10450 

10487 



S£Q ID KO : 3 3 ( cona 1 b\&*tti n polyA ) 

tctgccatt9 ctgcttcctc tgcccttcct cgtcactctg ^atgtggctt cttcgctact 
gce&e&gcaa gaaafcaasat ctcasicatct aaatgggttt cctgaggctt tfeeaagagtc 
gttaagcaca ttccttcccc a^eacccctt gctgcaggcc agtgccaggc accaackfcgg 
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ctactgcfcgc ccatgagaga aatccagfctc aatattttcc aaagcaaaat ggatfcacata 
tgccctagat cctgattaac aggcgfcttgt attafccfc&gt gcfcfctcgctt cacccagatt 
a t cccs t^*gc c£ccc 



SEQ ID NO; 34 (exemplary antibody light chain sequence) 



1 


gagctcgtga 


tgacccagae 


tccatcctcc 


CtQtCCQCCt 




^t^^ffn ^^fl "C* ^rac 


61 


atcagttgca 


gggcaaatca 


ggacatcagc 


a^fctatii tsa 


act gg tat ca 


gcagaaacca 


121 


gatggaactg 


ttaaactcct 


gatctaccac 


acatcaagat 


tacactcagg 


ggt ccca tea 




aggt tcagkg 


gc&gtgggtc 


tggaacag&t 


tat tc^ctca 


cca t t agcaa 


cctggagcaa 


241 


gaagattttg 


ccacttactt 


ttgecaacag 


ggtaatacgc 


ttccgtggac 


gttcggtgga 


301 


ggcaccaacc 


tggaaaccaa 


acgggctgat 


gctgcaccaa 


ct.gta£ccat 


cttcccacca 


361 


tceagtg&gc 


agttaaeacc 


fcgg&9gtgcc 


tcagtcgtgt 


gcttcttgaa 


caaettctac 


421 


cccaaagaca 


tcaatgtcaa 


gtggaagat t 


gafcggcagtg 


aacgacaaaa 


tggcgccctg 


481 


aacagt tgga 


ctgatcagga 


cagcaaagac 


agcacccaca 


gcacgagcag 


caccct cacg 


541 


tCgacc&agg 


acgagtatga 


acgacataac 


agctatacct 


gtgaggccac 


t^ca caaga oa 


601 


tcaacttcac 


ccat tcstcaa 


gagcttcaac 


aggaatgagc. 


gttaa 





SEQ ID NO: 35 (exemplary antibody heavy chain sequence) 



1 


cfccgagt cag 


gacctggcct 


ggcggcgccc 


tcacagaacc 


tgtccatcac 


ttgcactgtc 


61 


tetgggtttc 


cat taaccag 


ctstggtgta 


cactgggttc 


gccagcctcc 


aggaaagggt 


121 


ctggaa tggc 


tgggagtaat 


atggactggt 


agaagcacaa 


ct tataattc 


ggctctcatg 


181 


tccagactga 


gcatcagcaa 


agacaactcc 


aagagccaag 


ttttcttaaa 


aa tgaacagt 


241 


c tgcaaactg 


atgacacagc 


Vs-atttaotzao 


tgtggcagag 


ggggtctgat 


taegtcctt t 


301 


gctatggact 


actggggtca 


aggaacctca 


gtcaccgtct 


ccfccagccaa 


a a c^s csccc 


361 


ccafc ctgt ct 


atccactggc 


ccctggatct 


gctgcccaaa 


ctaactccat 


ggtgraccctg 


421 


ggatgcctgg 


tcaagggcta 


ttfcccctgag 


ccagtgacag 


tgacccggaa 


ctctggatec 


481 


c tgtccagcg 


gtgtgcacac 


cttcccagct 


gtcctgcagt 


etgacctcta 


cactctgagc 


S41 


agctcagtga 


cfcgtcccctc 


cagcacctgg 


cccagcgaga 


ccgtcacctg 


caacgttgcc 


601 


cacccggcca 


gcagcaccaa 


99£99&£aag 


aaaat tgtgc 


ccagggat tg 


taetagt 



SEQ ID NO: 36 {pTra^CS) 

1 ctgacgcgcc eegtsgegge geattaageg eggcgggtgt ggtgg££acg cgcagegtga 
61 cegctaeact tgecagegee ctagegcccg ctccfcfctcgc ttcct£ccce tcccttctcg 

121 ccaegctcgc eggcatcaga ttggetastg gcc&ttgcat acgtfcgtate catatcacaa 

181 tatgtaeact tafc&ttgget eatgtcca&c attaccgeca tgttgaeatt gattattgac 

241 tagttattaa tagfcaatcaa ttacggggtc a^tagfcfccat ageccatata tggagttccg 

3 01 cgttacataa ettaeggtaa atggcccgce tggctgaeeg cceaacgace cecgcccatfc 

361 gaegtcaata atgacgtatg ttcecatagt aacgccaatca gggacttfcce attgaegtca 

421 atgggcggag tatttaeggt aaaetgecca cfctggcagta catcaagtgt ateatatgee 

481 a«>gtacgccc cctatrgscg tcaatgaegg taaatggccc gectg^catt atgcccagta 

541 catgacctca tgggaefctte ctactcggca ge:acatctac gtattagtca tegctattae 

601 cacggtgatg cggttctggc agtacatcaa tgggcgtgga tagcggtttg actc&cgggg 

661 atttccaagt etccacccca ttgacgtcaa tgggagtctg ttttggcacc aaaaceaaeg 

721 ggactttcca aaatgtcgta acaactccgc eeeatfcgacg caaaegggcg gtaggcgtgt 

782 aeggegggag gtctatacaa gesgagcteg cccagtgaac c^tcagatcg cccggagacg 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



1081 
1141 
1201 
1261 
1321 
13 Si 
1441 
1501 



841 ccatccacge tgfctttgacc teoatagaag acaccgggac cgatecagcc tecgcggceg 
901 ggaaeggtgc attggaac^c ggatfcceccg tgccasgagt gacgtaagta ccgectatag 
$61 actetatagg cacscccctt tggctcttat gcatgcfcata ctgtttfcfcgg ctfcggggccfc 
1021 ataeaeccec gcttccfetat getataggt*g atggtatagc ttagectata ggtgtgggtt 
iittgaccact attgaecact cceetattgg tgacgatacct ttecatt:aet aatccataac 
^tggctcttt: gccacaacta tctctattgg cfcatatgcca atactctgtc cttcagagac 
tgacacggac tctgtatctfc fca^a^gatgg ggtcccafcfct attattfcaca astccacata 
vacaacaacg ccgfccccccg tgcccgcagc ttttattaaa catagcgtgg gateteeacg 
egaatctegg gtacgtgttc c^acatggg ctetcetecg gtagcggcgg agcttccaea 
ncegagcect gg £ ccc a t gc c t ccagcgg c tcatggtcgc tcggcagctc ettgetccfca 
icagtggagg ceagactfcag gcMagcaca atgeccacca ccaccagtgt gccgcaeaag 
< ? ccgtggcgg tagggtatgt gtctgaaaat gagcgtggag at.tgggcr.cg cacggctgac 
1561 gcagatggaa gactx&aggc agcggeagaa gaagatgcag gcagctgagt tgtfcgfcattc 
1621 tgataagagt cagaggtaac ; : tcccgfc. tgcg gtgctgtfcaa cggtggaggg cagtgcagtc 
1681 t;gagcagt:ac tcgttgctgc cgcgcgcgce accag&cata atagctgaca gactaaeaga 
ctgttcctfct ccatgggtct t:ttct9cag£ 
ttacacgact ctctttacca atcctgcccc 
acgctggctt gccacgcatfc act fcgac tigs 
aacctesccaa ccaaagcgag aacaaaacat 
aatcgtcacc tccacaaaga gc^actcgct gcatacegtt 
tcgggeaata cgatgcccat cgtaettgtt gactggtctg 

ettatggfc&t tgcgagctte agfccgc&cta eacggtcgtt etgttactefc fctafcgagaaa 
gcgfctcccgc tttcagagca atgttcaaag aaagctcatg sccaatttct agcc«acctt 

taccgagfcaa caccacaccg ctcattgcca gtgatgccgg ctttaaagtg 
aatccgtcga gaagetgggt: tggtactggt taaguegagt aagaggaaaa 
cagacctagg ageggaaaac tggaaaccta tcagcaaccc acatgafcacg 
acccaaagac tttaggctac aagaggctga ctaaaagcaa tccaatctca 



1741 
1801 
1851 
1921 
1931 
2041 
2101 
2161 

» 2 ^> 

2 2 SI 
2341 



caccgtcgga ccatgtgcga actegatatt 
gaattacact taaascgact caacagetta 

cr.cttaccga ac ttggecgt 
gaatcgaccg atigtt.aggt 
ggcatgctag ctttatcfcgt 
atattcgtga gcaasaacga 



aaaactctca 
aaeafccaaac 



2401 
24£1 
2521 
2501 
2641 
2701 
2761 
2821 
2881 
2341 
3 001 
30€1 
3121 
3181 
324X 
3301 
3361 
3421 
3481 
3541 



qc<3&qcafc£ c 

ccatggtata 
gtacaatatg 
s^catetagtc 
*:gccaaa£tc 
actcattgtc 
ccagcaacta 



tafctgtataa 
accaccegtc 
acfctacctgfc 



aagegaatgc agattgaaga 
ctaegecata gccgaacgag 
i.tgcttcaac taacat.gttg 
caettccagg ctaacacagt; 
gaagtctcge 
c tact age fee aaaacttatt 
tetagagega tccgggafcct 
ctttaaaaat aaaaaaeaat 



atctcgctct aaaggccgaa aaaaccagcg ctcgacacgg 
acctaaaatc tactcagegt eggcaaagga gccavgggtt 
tga&afctcga aeacceaaac aacttgttaa tatctattcg 
aacctfcecga gacttgaaaa gccctgccta cggactag9c 
cagctcagag cgttttgata teatgetget aatcgccctg 
gettgegsgc gtfccafcgcfcc agaaacaagg ttgg^acaag 
cagaaatcga aaegtactet caacagtccg ctta^gcatg 
ggcattctgg ctacacaata acaagg^aag acttacccgt ggctgcaacc 

cacaeatggt: tacgettegg ggaaattatg aggggatege 
egggaaaage «r;tggr.gacc aaaggtgcct cttatcatca 
taetcagtgc ctgttataag cagcaatcaa ttatgattga 
tgcct:acatc a caa caaaaa c t g a £ 1. 1 aa c aaatggfcfcgg tetgecttag aaagtatact 
tgaacattat c.r.tgatsata ttattgataa taattaaaaac cctaccccta cccaagaagt 
gatgectate attggttgga aegaacttga aaaaaatcag cct:r,gaat:ac attactggta 
aggtaaaege catngtcagc aaattgatcc aaaagaacca aefctaaaget ttcctgaegg 
aatgttaat t ctcgttgacc ctgagcactg atgaatccec t:aatgatttt ggtaaaaatc 
atfcaagttaa ggtggataca caccttgtca tatgatcceg gtaatigtgag ttagctcact 
cattaggcac cccaggctt^ acacttr.atg cttccggctc gtatgttgtg tggaattgtg 
ageggataac aattccacac aggaaacagc tatcsaccatg attacgecaa gegegcaatt 
3 601 aaccctcact aaagggaaca aaa^cfcggag ctccaccgcg gtggcggccg crctagaact 
3661 agtggatccc ccgggctgca ggaactcgac. atcaagctta tcgataccgc tgacrctcgag 
3 721 ggggggcccg g t ac cca a 1 1. cgecc t a tag tgagtegta-c tacgcgcgct cactggccgf. 

cgttttacaa cgtcgtgact: gggaaaaccc tggcgctacc caacttaatc gccttgcagc 
3S41 ^.catcccccct ttcgccagct ggegtaatag egaagaggee cgcaccgatc gcccttccca 
3«01 acagttgege agectgaatg gcgaatggaa attgtaagcg ttaatatttt gttaaaattc 
3«Si gcgttaaatfc tttgttaaat cagcfccatfcfc tttaaccaat aggecgaaas: eggcaaaate 
4 021 ccttataaac caaaagaaca gaccgagata gggttgagtg ttgfctccagt ttggaacaag 



4 081 ^.gtccactac 
4141 cfatggcccac 



taaagaacgt 
tactceggga 



.201 



ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 

tcataegaca agatgtgtat ccaccttaac t;taaegact.£ 

t.taccaaaat cafccagggga ttcafccagtg etcagggfcea acgagaatr.a acattccgtc 

42S1 aggaaagctt: attgatgatga tg"cgcctaaa aacttact:ca atggctggtt atgeatateg 

4321 caatacatgc gaaaaaccta aaagagefctg ccgataaaaa aggceaatxt afctgctattt 

4301 ^.ccgcggctt tttafctgagc ttgaaagata aataaaatag ataggtetta tctgaagcta 

4441 aatcttcttt ategtaaaaa afege^ctctt gggttatcaa gagggtcatt afcatttcgcg 

4 SOI gaataacatc atttggtgac gaaataacca agcacttgtc tccr.gttt.ac tcccctgagc 

4561 tttgaggggtt aacatgaagg tcatcgat.ag caggataata atacagtaaa acgctaaacc 

4621 aataatccaa atccagccat cccaaattgg tagt:gaat:ga ttataaataa cagcaaacag 

4S61 taatgggcca ataacacegg ttgcattggt aaggctcaec aataatccct gtaaagcacc 

ggatagacafc cactccccgt aatgcaggfca aagcgat.ccc 

aaacagggaa a a c t a a c c a a ccttcagafca t.aaacgctaa 



4741 ctgetgatga ctctctgfett 
4^01 accaccagcc aataaaafcta 



48fil aaaggcaaat gcactactat ctgcaataaa tccgagcagt actgccgctt tttcgcccat 
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4921 ttagtggcxa cscctfccetge cacaaaggefc tggaatacfcg agtgfcaaaag accaagaccc 
4981 gtaatgaaaa gccaaccatc atgctattca teate&cgat tctctgtaata gcaccacacc 
5041 gtgctggakt; ggccatc&at gcgctgaaat satsatcaac aaafcggcatc gctaaataag 
5101 '.gatgcatac cgatcagett fccgttccctt tagcgagggt taatfcgegcg ctfcggcgfc&a 
5161 ncatggtcat agetgtfcfcce tgtgfcgaaat tgtcatecgc tcacaafctcc acaeaacafca 
5221 cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagt^cta acccacatta 
52S1 atfcgcgttge getcaecgcc cgctttccag tcgggaaacc tgtcgcgcca gctgcattaa 
5341 t;gaatcggce aacgcgeggg gagaggcggt ctgcgtattg ggcgctcctfcc cgcttcctcg 
54 01 ctcactgaet cgctgcgefcc ggtegfctcgg etgcggcgag cggtatcagc tcaetcaaag 
5461 gcggta&fcac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgageaaaa 
5521 ggccagcaaa aggecaggaa ccgtaaaaag gccgcgfctgc tggcgtfcttt ccataggctc 
5581 cgcccccctg acgagcafcca eaaaaatcga cgcfccasgte agaggtggcg a&acecgaca 
5641 ggacfcaeaaa gataccaggc gtttccccct ggaagctcce tegtgegcttc tcctgttccg 
5?0l accctgccgc tfcaccggata cctgcccgcc tttctccctt cgggaagcgc ggcgctxtct 
5761 catagctcac gctgtaggta tcteagtfccg gtgtaggtcg ttcgctccaa gctgggcfcgt 
5821 gtgcacg&ac cccccgttca gcecgaeege tgcgcctxtat ccggtaacta fccgcctfcgag 
5881 tecaaccegg taagacacga cttatcgcca ctggc&gcag ccaetggtaa caggattagc 
5941 agagegaggt: atgtaggcgg tgecacagag ttcfctgaagt ggtggcctaa ctacggctac 
60 01 iicfcagaagga cagtatttgg fcatctgegec etgetgaagc cagttaecfct cggaaaaaga 
6061 gctggtagct ctfcgatccgg caaacaaacc acegctggta gcggtggfctt tttfcgfcttgc 
6121 aagcagcaga ttacgcgcag aaaaaaagga tcfccaagaag atcetctgat cttctctacg 
6181 gggtcxgacg ctcagtggaa cgaaaactca cgfctaaggga ttttggteat gagattatca 
6241 aaaaggatet fceacctagat cetxttaaat taaaaatgaa gttttaaatc aatctaaagfc 
6301 atatatgagt aaacttggce tgacagttac caatgcttaa teagfcgagge accta'cctca 
6361 gcgacctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgfcgta gaeaactacg 
6421 * tacgggagg getfcaccafcc tggccccagt gctgcaatga taccgcgaga cccacgctca 
€48 X ccggctccag atttaccagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 
6541 cctgcaacfct tatccgccte catccagtct attaattgtt gcegggaagc fcagagtaagfc 
6601 agtccgccag tfcaatagttt gcgcaacgtfc gttgccattg ccacaggcat cgfcggfcgfcca 
6661 egctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 
6721 cgatceccca tgttgtgc&a aaaagcggtt agctccttcg gfccctcegat cgttgtcaga 
6781 agtaagttgg ccgcagtgtt atcactcacg gttatggcag cactgcataa tcctcttacfc 
6841 gtca*cgccat ccgtaagatg cttfctctgcg actggtgagt actcaaccaa gtcattctga 
6901 gaacagtgfca tgcggcgacc gagtfcgctct tgcceggcgt caatacggga taatacegcg 
69S1 ceaeatagca gaactttaaa agtgcccacc aCtggaaaac gtccttcggg gcgaaaaccc 
7021 ccaaggatct taccgctgtt gagatccagc tcgatgtaac ccactcgtgc acccaactga 
7081 tcfctcagcat cr.tttacttt caccagcgct cctgggtgag caaaaacagg aaggcaaaat 
7141 gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 
7201 caatattatt gaagcattta tcagggttat tg^ctcatcga gcggatacat attfcgaatgt 
7261 atttagaaaa ataaacaaat ^gggg^tccg cgcacatfctc cccgaaaagt gccac 

SBQ ID NO: 37 (chicken ovalbiiffiin ehancer) 

cc 999^^9^ gaaaaatgcc aggtggacta tgaactcaca tccaaaggag 

cttgacctga taccfcgattt tcttcaaact ggggaaacaa cacaatccca caaaacagct 

cagagagaaa ccatcactga tggctacagc accaaggtat gcaatggcaa tcc&tfccgac 

attcatctge gacctgagca aaatgattba tctctccatg aatggfctget tctttccctc 

atgaa^aggc aatttccaca ctcacaatat gcaacaaaga caaacagaga acaactaacg 

tgctc^ttcc taatgtcaaa attgtagtgg caaagaggag aacaaaatct caagttctga 

gtaggtttta gcgattggat aagaggcttt gacccgtgag ctcacctgga cttcatatcc 

ttttggataa aaagtgcttt tataactttc aggtctccga gtctttattc atgagactgt 

tggtttaggg acagacccac aatgaaatgc ctggcatagg aaagggcagc agagccfctag 

ctgacctttt cttgggacaa gcattgtcaa acaatgtgtg acaaaactat ttgtactgct 

ttgcacagct gtgctgggca gggcaatcca ttgccaccta tcccaggtaa ccttccaact 

gcaagaagat tgttgcttac tctctcfcaga 

SEQ ID HO; 38 (5' untranslated region) 

GTGGATCAACATACAGCTAGA^GCTGTATTGCCTt^AGCACTCAAGCTCAAJ^ 
ACC 



SEQ ID NO:39 (putative cap site) 

AGATACAGCTAG AAAGCTGTAT TGCCTTTAGC ACTCAAGCTC AAAAGACAAC TCAGAGTTCA 
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SEQ XO NO; 40 {fragment of oval bum! 
GAGCTCAQAAT GGTTTCTTTA CTGTTTQTCA 



n promoter - chicken} 
ATT CT ATT AT TTC&ATACAG 



A&C&ATAGCT TCTATAACTG A^T&TATTT GCTATTGTAT ATTATGATTQ 
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TCCCTCQhAC CATGAACACT CCTCCAGCTG AATTTCACAA TTCCTCTGTC 
ATCTGCCAGG CCATTAAGTT ATTCATGGAA GATCTTTGAG GAACACTGCA 
AGTTCATATC ATAAACACAT TTCAJUVTFGA GTATTGTTTT GCATTGTATG 
GAGCT&TGTT TTGCTGTATC CTCAGAAA&A AAGTTTGTT& TAAAGCATTC 
AC&CCCATAA AAAGATAGAT TT&A&TATTC CAGCTATAGG AAAGAAAGTG 
CGTCTGCTCT TCACTCTAGT CTCAGTTGGC TCCTTCACAT GC&TGCTTCT 
TTATTTCTCC TATTTTGTCA AGAAAATAAT AGGTCACGTC TTGTTCTCAC 
TTATGTCCTG CCTAGCATGG CTCAGATGCA CGTTGTAGAT ACAAG&AGGA 
TC&AATGAAA CAGACTTCTG GTCTGTTACT ACAACCATAG TAATAAGCAC 
ACTAACTAAT AATTGCTAAT TATGTTTTCC ATCTCTAAGG TTCCCACATT 
TTTCTGTTTT CTTAAAGATC CCATT&TCTG GTTGTAACTG AAGCTCAATG 
GAAC&TGAQC AATATTTCCC AGTCTTGTCT CCCATCCAAC AGTCCTGATO 
GATTAGCAGA ACAGGCAGAA AACACATTGT TACCCAGAAT TAAAAAOTAA 
TATTTGCTCT CCATTCAAT C CAAAATGGAC CTATTG&&AC TAAAATCTAA 
CCCAATCCCA TTAAATGATT TCT&TGGCGT CAAAGGTCAA ACTTCTGAAG 
GGAACCTGTG GGTGGGTChC AATTC&GGCT ATATATTCCC CAGGGCTCAG 

c ;• 

SEQ ID NO z 4 1 pTnMCS CCMV - CHOVg -ent -ProTnsul isn - synPA) 
1 ctgacgcgcc ctgfcagcgge gcattaagcg cggcgggtgt ggtggttacg egeagegcga 
61 ccgctacacs: tgccagcgcc ctagcgcceg ctccttfccgc tttcfctccct tcctfctctcg 
121 ccacQfctcgc cggcateaga t£ggctatt;g gccatfcgcas: acgttgtatc catatcataa 
181 tat^cacstt fcatattggct oatgtccaac atfcaccgcca sgtfcgacatfc gafcfcafctgac 
241 t.agttattaa fc&gtaatcaa fcfcacggggtc atfcagttcat agcceat&ca szggagfctccg 
301 cgttacataa cttacggtaa atggcccgec tggcfcgaceg cccaacgacc cccgcccatfc 
361 gacgceaata atgacgt&tg ttcecatagt aacgccaata gggacttxcc attgaegtca 
421 atgggfcggag £a£fctacggc aaactgccca cttggeagta catcaagcgt atcatatgcc 
481 aagtaegcce cctattgaeg tcaafcgacgg fca&afcggeee geecggcafct atgcccagta 
S41 catgscctta tgggacttcc cfcaccttggca gtacatcr.ac gfcattagtca tcgctattac 
€01 catggtgatg cggtttt<ggc agtacatcas tgggcgtgga tagcggietg actcacgggg 
661 a££fcccaagfc ctccacccca txgacgtcaa tgggagtttg ttttrggcaec aaaatca&cg 
721 ggactfctcca aaatgtcgta acaactecgc cccatfcgacg caaatgggcg gtaggcgtgt: 
?&1 a^ggtgggag gfcc^atafcaa gc&gageteg fctfcagtgaac cgtcaga^cg cctggagacg 
841 ccatQcacgc tgttt tgacc tccacag&ag acacegggae cgatccagcc tccgcggccg 
901 ggaacggtge attggaacgc gg&ttccccg tgccaagagt gacgtaagca ccgcctatag 
atrtctafcagg cacacccctt tggctcttat gcatgctafca cfcgtvttttgg cfctggggccc 
1021 atacaccccc gcfctccfcfcat getasaggtg atggcatagc txagcetata ggcgtgggs:t 
1081 atfcgaccatt atfcgaccact ccccxatxgg tgacg&tact: ttccattacfc aatceataac 
1141 itggcfccttt gccacaacta tctccatxgg ctatafcgccs afcactctgtc cttcagagac 
1201 * gacacggac tcfcgtatfct.t tacaggacgg ggtcccattt afcfcat ttaca aactcacata 
1261 tacaaeaacg ccg&cccccg tgcccgcagt ttttattaaa catagcgtgg gacctccacg 
1321 cgaatctcgg gfcacgtg^tc cggacstggg ctctcctecg gcagcggcgg agcttccaca 
13S1 t:ccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc ctcgctccta 
1441 acagtggagg ccagac££ag gcacascaca afcgeceacea ccaccagtgt gccgcacaag 
1501 gccgtggcgg cagggtatgt gtctgaaaat gagcgr.ggag attgggctcg cacggctgac 
1S61 gcagatggaa gacttaaggc agcggcagaa gaaga?:gcag gcagccgagt tgr. tgtattc 
1€21 tgacaagagc cagaggtaac tcccgfctgcg gtgctgtcaa cggtggaggg cagtgtagcc 
1681 tgagcagtac fccgtcgctgc cgcgcgcgcc accagacana atagctgaca gactaacaga 
1141 ctgttccctt ccatgggcct tttctgcagt caccgtcgga ccatgtgcga acccgatatt. 
XS01 ttacacgact ctccttacca attctgcccc gaatcacact caaaacgaet caacagccta 
16S1 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 
1921 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 
19S1 aaccgccacc cccacaasga gcgactcgct gtataccgtt ggcacgetag ctccatctgt 
2041 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgcga gcaaaaacga 
2101 otcatggtat cgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 
21S1 c:cgt£cccgc cttcagagca at.gttcaaag aaagctcatg accaatttct agccgacctt 
2221 cjcgagcattc tacegagtaa caccacaccg ctcattgtca gtgatgccgg ct. ttaaagtg 
22S1 i:ca£ggtata aatccg^fcga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 
2341 gtacaatac.g cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 
24 01 tcatccagtc actcaaagac tttaggctat aagaggctga ccaaaagcaa tccaatccca 
2461 J gccaaattc tattgtataa stctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 
2 521 i rtcatcgtc aecacecg&c sectaaaatc cactcagcgt cggcaaagga gccatgggtt 
2581 efcagcaacfca acctacctgt Ggaaattcga acacccaaac aacfctgttaa catctattcg 
2641 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 
2701 c^acgccata gccgaacgag cagctcagag cgttttgata tcacgctcgct aatcgccecg 
2761 ^^gc?:tcaac fcaacatgtcg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 
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2 921 cacttceagg 
2881 gaagtttfcgc 
2941 ctactagctc- 
3001 tctagagcga 

3121 tgcctacatc 
3181 tgaacattafc 
3241 gafcgcctafcc 
3501 aggta&acgc 
3361 aafcgtfcaate 
3421 afcfcaagetaa 
3481 cactaggcae 

3 541 agcggafcaac 
3601 aaccctcacfc 
3661 agfcggatccc 
3721 aafcatgeaca 
3^81 acfcagtfcatfc 
3841 cgcgttaeafc 
3901 ttgaergfceaa 
396% caatgggtgg 

4 021 ccaagfcaegc 
4081 fcacatgaccc 
4141 accatggtga 
4201 ggattfcccaa 
4 2 Si cgggaettfcc 
4 321 gfcaeggtggg 
4 3 81 egceatceae 
4441 egggaaeggfc 
4 SOl agactcfcata 
4 561 ctatacaccc 
4621 teafctgaeea 
4631 acafcggctct 
4741 aefegacacgg 
4 SOI fcataeaacaa 
4 861 egcgaafccec 

4 921 cafcecgagcc 
4981 taacagtgga 
5041 
5101 
5161 
$221 

52 SI gactgfcfcect 
S3 41 cgcagcaage 
540 I tgagaacatc 
5461 tgcaaaagac 
5521 atfceggagac 

5 SOI agacatcctc 
5641 act?: target 
5701 gtat&gagga 
5761 catcaattcc 
5821 efcccgtggafc 
5881 ggsgasaaca 
5941 aagc&aaccfc 
£001 tgagaaaatg 
6051 gfcfcgecfcgat 
6121 gactgaatgg 
6 3. Si eatgaagafcg 
6241 egfcgttfcage 
6301 tcaagctgfce 
6361 ageagaggefc 
6421 cefcctfccfcgt 
6481 ccgeggceag 
6S41 gaegeaceag 
6 SOI fcgtggctgga 
6661 fccacacctgg 
6721 aagaeccgcc 
67S1 ggtgcaggea 
6841 gaacaatget 



ccaacacagt 
ggcatfcetgg 

3333$ fc t>at t 

teegggatee 

aaaaaacaae 

acaacaaaaa 

ctfcgateafca 

attggfctgga 

eatfcgfccagc 

ctcgfcfcgacc 

ggtggacaca 

cccaggcfcCt 

a a 1 1 t cacsc 

aaagggaaca 

c eggg cat c a 

tttatatfcgg 

aatagtaatc 

aacttaeggt 

caatgaegta 

ag fc a t; fc fc a eg 

cccct.at.cga 

tafcgggactt 

tgcggfcttfcg 

gtctccaccc 

eaaaatgtcg 

aggtetafcat 

gcfcgtfctfcga 

gcat fcggaac 

ggcacacecc 

ccgcfctcctt 

tfcatfcgacca 

ttgccacaac 

aetctgtafcfc 

egccgtcccc 

gagtacgtgfc 

etggceecafc 

ggccagac&fc 

ggtagggtxafc 

aagacfcfcaag 

gtcagaggta 

actcgtfcgct 

teccafcgggt 

a fcggaa c efc t 

ttefcactgee 

agcaccagga 

agtafctgaag 

aaecaaafcea 



ac aaggga ag 
tacgcfcefcgg 



aaafcggfctgg 
fcaafcaaaaac 



ctaeacaafca 
e&cacafcggfc 
egggaaaage 
fcaeceagtgc 
ctgatfctaac 
ttattgataa 
atg a ac e tg a 
aaattgafc.ee 
ctgagcacfcg 
cafcctfcgtca 
acactfcfcafcg ctfceeggctc 
aggaaaeage catgaccatg 
aaageeggag ctccaccgcg 



eaacagttcg 
act fcacfcegt 



c fc fc aggca t g 
ggcfcgcaacc 




fcfcfcafccafcca 
tfcatgatfcga 
.aaagfcafcafct 
tccaagaagt 
afceactggta 
tfcectgaegg 



:ca 



cteatgt 
aa 

aaafcggcccg 



acattaccgc 

tcattagttc afcagcecata 
ccfcggctgac 



cacttggcag 



tec tact tgg cagtacatct 

gcagtacaec aacgggegtg 

cafctgaegic aatgggagfcfc 

t&acaaefcce gccccatfcga 

aageagagct cgfcfcfcagfcga 

cefcecataga agacaceggg 

g egga £ tec c eg tgecaaga 

tteggetctt atgeatgeta 

atgetatagg tgatggtata 

ctcecctsfct ggt gacgata 
tacetctact ggctatatgc 
t tc.acagga^ ggggcccca? 
cgtgcccgca gfcfctttatta 

ggctcttctc 
gc t ca t gg £ c 
aggcaeagea caatgcccac 
g-egtctgaaa atgagcgtgg 
geageggcag aagaagatgc 



cagca&ttaa 
tct^ecttag 
cttsteccta 
ecctgaacac 
act t;aaagct; 
taatgattts: 
gtaafcgtgag tcagctcac?: 
gtacgttgtg tggaattgtg 
attacgecaa gegegcaatt 
gtggcggccg ctctagaace 
atacgttgta t.ccafca&cac 

ttgattattg 

cccccgccca 
ecattgaegt 
gtatcafcatg 
5ctatgccca« 
categctatt 



f.agggacttt 
tacafceaagt 
ccgcctggca 



gatageggtt 
tcgfctttggca 



acegtcagat 
accgatccag 





tacfctj.tfcfc.fct 
gcfcfcagccfca 
Gttfcccatfca 

fctatfcactca 
aacatagegt 

cgg^agcg9 c 

gcfccggcagc 
caccaccagt 
agafcfcgggct 
aggcagcfcga 
aacggtggag 



c c aa aa fc c a a 
cggfcaggcgfc 
cgcctggaga 
cctccgcggc 
fcaccgcctafc 
ggct: tggggc 



ctaatccata 
teefctcagag 
caaat fceaca 
gggatctcca 

^{^^ . ■ 



cgcacggcfcg 



gccgcgcgcg ccaccagaca 
cfctttctgca gfccaccgtcg 

afctcaaggag 
catgfccagct 
cacagataaa taaggttgfct 
cfccagtgfcgg cacafccfcgta 
ccaaaccaaa tgatgt 1 1 a t 
acccaafccc t gec 



ggatccatgg 
ctcaaagccc 



gcfcccatcgg 




ggcfctggaac cfcatcaacfct 



t ctcaaactg 
ttfcaaggafcg 
gcgcagatga 
aaga t cc tgg 
gaagtctcag 
accagfctcca 
gaggaaaaat 
tcfcfccagcca 




afceaagcaca 
cagafcgaege 
cagatgaege 

tggaagefcet 



gcctgcagcc 



tcaaacagct 
tggaattafcc 
caafcggfctcfc ggttaafcgcc 
aagacacaca ageaatgeefc 
tgea ccagat tggtt fcat tt 
agcfctccacfc fcgccagtggg 
geefctgagea gefctgagagfc 
atgctatgga agagaggaag 
acaacctcac atetgfcctca 
atctgfcctgg cat etc etc a 

caafcgaagca 
ctctgaagaa 
egccgttctc 
gacgcaccag 
aecagcagat gaegcaacaa 
ggatgacgat gacaaattfcg 
ct acc tagtg tgeggggaac 
ggacctgcag gtggggcagg 
cttggccctg gaggggfcece 
ctgctccctc taccagctgg 



cgctfcfcgata 
aacgfcfccact 
t eg t c cage c 
ttgcagtgfcg 
geagafceaag 
agaaafcgtce 
attgtctfcca 
t fc c ag ag fcga 
agagtggcat 
acaafcgagea 
afcaatcaaet 
afceaaagtgt 
atggctatgg 
gcagagagce 
ggcagagagg 
ttfcagggetg 
fcfccfcfctggca 
cagatgaege 
cafcgtat.ect 
fcga ac caa ca 
gaggefcccfct 
tggagctggg 
fcgc agaagcg 
agaactaetg 



aacttccagg 
ettcaefctag 
fctgeeagtag 
tgaaggaact 
ccagagagct 
tfccagccaag 
aaggactgfcg 
ccgagcaaga 
caafcggecte 
tgttggtgct 
tegaaaaaefc 
aettacefceg 
gcatfcacfcga 
tgaagatatc 
fcSS^agggtc 
ace a t e c a t fc 
gatgtgfc etc 
aceagcagat: 
gaaaggcttcfc 
ectgtgeggc 
ct acacaccc 
egggggcccc 
fcggcattgtg 
caact agggc 



81 
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6901 
6S61 
7021 

7141 

7201 

7261 

7321 

7381 

7441 

7501 

7561 

7621 

7681 

7741 

7801 

7861 

7321 

7982 

8041 

83.02 

8162 

8222 

8282 

8341 

8401 

8461 

8521 

85S1 

8641 

6701 

8761 

8821 

8881 

8941 

9001 

9061 

3121 

3181 

3241 

3301 

3361 

9421 

3481 

3542 

9602 

S661 

9721 

3781 

9841 

9901 

3361 

10021 

100S1 

10141 

10201 

10261 

10321 

10381 

10441 

10501 

10561 

10621 

10681 

10741 

10801 

10861 



gcctaaaggg cgaattateg cggccgctct agaccaggcg cetggatcca gatcaetfccfc 
ggeta&taaa agateagagc tctagagatc tgfcgtgttgg ttttttgfcgg afcctgctgtg 
cctfcctagtt gecagccatc fcgttgtttgc ccctcccccg fcgcct£eet£ gaceetggaa 
ggfcgecactc ceaetgteet fctcctaafcaa aafcgaggaaa ttgcategca ttgtcfcgagt 
aggfcgt:ca£t ctattetggg gggtggggfcg 999C&0caca gcaaggggga ggattgggaa 
gaeaatagea ggcatgetgg ggaegcggtg ggctcfcatgg gtacctctcfc ctctctctct 
ctctctctct ctctctetct ctetcggfcac ctctctcgag ggggggeceg gtaeccaatt 
cgcccta&ag tgagtcgtafc fcacgcgcgcfc eactggccgt; egfctttacaa cgtcgfcgaet 
gggaaaaccc tggcgttacc eaactita&te geetcgcagc acatceccct ttcgceagct 
ggegtaatag cgaagaggce cgcaccg&fcc geecrtcccca acagfctgcgc agcctgaatg 
gegaatggaa attgtaagcg ttaatafcttt gttaaaattc gcgtfcaaatt tfctgttaaat 
cagctcattt tttaaecaat aggecgaaat: cggcaaaatc ecfctataaat caaaagaata 
gaecgagafca gggcfcgagtg tfcgttecagt tfcggaacaag agtecacfcat taaagaacgt 
ggacfcccaac gfccaaagggc gaaaaaccgt ctateagggc gacggcccac tactccggga 
tcatatgaca agatgfcgtat ccaecfcfcaac fcfcaatgatfct; fctaccaaaat cattagggga 
fcfccate&gtg cfccagggfcca acgagaatta acattecgtc aggaa&gefct atgatgacga 
fcgfcgcttaaa aacttactca atggctggtt atgcatatcg caatacatgc gaaaaaccta 
aaagagcttg ecgataaaaa aggceaatfct attgctattE accgcgget;t tfctattgagc 
ttgaaagata aataaaatag afcaggfcfcfcta tttgaagcta aatcttc&tt atcgtaaaaa 
atgcccfcctt gggttateaa gagggtcatt afcatttcgcg gaataaeafce atttggtgac 
gaaafcaacfca ageacttgfcc fccctgfcttac tcccctgage tfcgaggggtt aaeatgaagg 
fccafccgafcag eaggataafca ataeagfcaaa acgctaaacc aataatecaa atceagccat 
cecaaattgg tagtgaatga fctafcaaataa cagca&acag taatgggcca ataacaccgg 
ttgcattggt aaggctcacc aat:aatccct gtaaagcace ttgetgatga ctctttgttt 
ggatagacat cactccctgt aacgcaggfca aagcgatcce accaceagcc aataaaatta 
aaaeagggaa aactaaccaa ecfcfccagata fcaaacgctaa aaaggeaaat geactactat 
cfcgeaataaa tcegagcagt actgccgttt tfctcgcccat Gtagtggeta ttcfctcctgc 
cacaaaggcfc tggaatactg agfcgfcaaaag accaagaccc gtaatgaaaa gecaaccate 
atgetattca fccafccaegat ttcfcgfeaata gcaccacacc gtgctggatt- ggctatcaat 
gcgctgaaat aataatcaac aaatggcatc gtfcaaataag tgatgtatac cgatcagctt 
t:t:gttccct:t tagttgagggt taattgcgcg ctxggegfcaa tcatggtcat agctgttt:cc 
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcat^aaagtg 
fcaaagcctgg ggtgcctaat: ggigtgagcfca actccacatta at.fcgegt.t:gc gctxcactgcc 
cgctttccag tcgggaaacc tgtcgtgcca gcfcgcattaa tgaatcggcc aacgegeggg 
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgc?:gcgct:c 
ggtcgttcgg ctgcggcgag cggcafccagc teacteaaag gcggtaatac ggttatccac 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggcca^caaa aggccaggaa 
ccgfcaaaaag gccgcgttgc fcggegttttt ccacaggctc cgcccccctxg acgagcatca 
caaaaatcga cgctcaagtc agaggfcggcg aaacccgaca ggactataaa gataccaggc 
gtttcccccfc ggaagctccc tcgtgcgctc tccfcgtfcccg accctgccgc ttaccggatxa 
cctgtccgcc tttcfcccctt cgggaagcgt ggcgctttct eatagctcae gctgtaggta 
tctcagttcg gfcgtaggtcg tfccgctccaa gtgcacgaac cccccgttca 

gcccgaccgc tgcgccttat ccggtaacta tcgtctfcgag tccaaeccgg taagacacga 
cttategcca ctggcagcag ccactggtaa caggatfcagc agagcgaggt atgtaggcgg 
fcgctacagag ttetxgaagt ggtggcctaa ctacggctac ac«:agaagga cagtatttgg 
tatctgcgct ctgctgaagc cagttacctt eggaaaaaga gttggtagct cttgatccgg 
caaacaaacc accgctggta gcggtggttt ttttgttxtgc aagcagcaga s:tacgcgcag 
aaaaaaagga tCfccaagaag atcctcttgat cfcfcttctacg gggtcfcgacg ct:cagtggaa 
cgaaaacfcca cgfctaaggga ttttggfccafc gagatratca aaaagga&ct tcacctagat 
ccttfctaaac taaaaafcgaa gfctttaaatc aatcfcaaagt atatatgagt aaacttggtc 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc fcatfcfccgtfcc 
atccatagtt gcctgactcc ccgtcgtgta gataacfcacg atacgggagg gcttaccatc 
fcggccccagt gctgcaatga taccgcgaga cccacgctca ceggcfcccag atttatcagc 
aataaaccag ccagccggaa gggccgagcg eagaagtggt cctgcaactt tatccgcctc 
cat:ccagt:ct atfcaattgtt; gccgggaagc fcagagtaagt agetcgccag ttaattagttfc 
gcgcaacgt:fc gt.tgccafctg ctacaggcat cgcggtgtea cgctcgtcgt tfcggfcafcggc 
ttcattcagc ^ccggctccc aacgatcaag gcgagt^taca tgatecccca tgttgfcgcaa 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagtfcgg ccgcagtgtt; 
atcactcatg gtfcatggcag caccgcataa ttctcttact gtcatgccat ccgtaagatg 
cfctttctgtg accggtgagt actcaaccaa gccattctga gaatagtgta tgcggcgacc 
gagttgctct tgeccggcgt caatacggga taataccgcg ceacatagca gaactttaaa 
agtgctcate atfcggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcageat cttttacttt 
caccagcgtt fcctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 
ggcgacaegg aaatgttgaa tactcatact cetcc^tttt caatattatt gaagcattta 
tcagggttat tgfcetcatga gcggatacat atttgaatgt atttagaaaa afeaaacaaat 
ssS^Sfct^^ cgcacatctc cccgaaaagt gccac 



8£ 



EP 1 539 7B5 B1 



ID NO: 42 (pTrsMOD {CMV- CHOVg -erjt - Prolnsul in- synPA) ) 

I ctgacgcgec etgtagcggc gcatfcaagcg cggcgggtgfc 995:935: tacg cgcagcgtga 
62 ccgcsacacfc tgccagegcc ctagqpcc^ ctcctctcgc tttcteccct teetttctcg 

121 ccacgtccgc cggcstcsga t^g^ctattg gccaetgcat acgfcfcgtatc catatcataa 

181 tatgtaeafct tatacfcggct cafcgtce&ac attaecscca tgtcgacatc gattafctgac 

241 tagttastaa t&gtaatcaa tcacggggte attagctcat agcceatata tggagtfceeg 

3 01 cgttacataa cttacggtaa atggcecgcc tggetgaecg cccaacgacc ceegeccatt: 

3 61 gacgtcaata atgacgcatg ttcccatagt aacgccaata gggactctcc attgacgtca 

4 21 atgggtggag tattxacggt aaacfcgccca cttggcagta catcaagtgc ateafcatgcc 
4 81 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcetggcatt atgcccagta 
541 eatgaeetca tgggaccttc ccaCttggea gtacacctac gtattagtca Kegcfcattac 
^01 eatggtgatg cggttttggc agtacafccaa tgggcgsgga t&gcggfctcg actcaegggg 
t>*61 atttccaagt ctceacccca tfcgacgtcaa tgggagsttg ctfcfcggcaec aaaatcaacg 
•f2X ggactttcca aaatgtegta acaactccgc cccattgaeg eaaatgggeg gtaggcgtgc 
781 acggtgggag 3t1ct.at.ataa gcagagctcg tttagtgaac egtc&gateg cctggagacg 
S4i ccatecacgc tgttfctgace feccatagaag acacegggac egacceagcc toegeggecg 
5*01 ggaacggtge attggaaege ggattececg tgccaagagt gaegtaagta cegcctacag 

actctatagg cacacccctt tggetcttat geatgetata etgttttfcgg ctcggggcct 
V-021 atacaccccc getfceefctat. gctat.aggtg atggtafcagc ttagectata gg^gtgggfct 
1081 attgaceatt attgaecact. cccct.ar.tgg tgacgatact. ttccattact satcestaac 
1141 atggctcttt gccacaacta t etc cat egg ctatatgeca a t.aetctgt c cct cagagac 
1201 tgacaeggae cetgtatctc tacaggaegg ggtcecattt attatttaca aattcacata 
1261 taeaacaacg ccgtcceece tgccegeagt ttttattaaa catagcgtgg gatefcecaeg 
1321 egaatctegg gtacgtgtte eggaeatggg cccttcceeg gtageggegg aget t ceaca 
1381 tcegagccct gg tcccat ge c* t ceag egg e teatggtegc tcggcagctc ettgetecta 
1441 acagtggagg ccagactfcag gcaeageaca atgeecaeca ecaccagtgt gcegsacaag 
1501 gecgtggcgg cagggtaege gtctgaaaat gagegeggag attgggcteg eaeggctgae 
1 561 gcagatggaa gacctaagge ageggcagaa gaagatgcag gcagctgagt tgttgtatcc 
1621 tgataagagt cagaggcaac tcccgttgcg gtgctgfctaa cggfcggaggg cagt.gr.agcc 
1681 tgagcagtae tcgttgctgc cgcgcgcgcc accagaeaea acagctgaca gaetaacaga 
1741 ctgttcettt ccatgggcet: tttctgcagt cacegccgga ccacgtgtga acttgatatt 
1801 ttacatgatt ctet&tacca attctgcccc gaattcacaet taaaacgact caacagctta 
1861 acgttggcfct. gccacgcatt acttgaetge: aaaaefcefcea efcettaeega acctggcegt 
1.921 aacctgccaa ceaaagc^ag aacaaaaeat aacatcaaac gaatcgaccg attgttaggt 
: 981 aatcgtcacc tccacaaaga gcgactcgct: geatacegtt ggcatgetag ctttatctgt 
;:04i cegggcaata egatgeecat cgtacctgtt gactggtccg atactcgtga gcaaaaacga 
^101 cctatggtat tgegagctfee agtegcacta caeggtegtt ctgctactct ttatgagaaa 
2161 gcgtccccgc ttccagagca atgttcaaag aaagctcatg accaatfctct agccgacctt 
2221 gegagcatte taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 
V.2B1 ecatggfcata aatccgttga gaagctgggt tggtactggt fcaagtcgagt aagaggaaaa 



1341 gt:acaacatg cagacctagg 
1401 ccatctagtc actcaaagac 
2461 tgecaaatte cattgtacaa 



ageggaaaac tggaaaccta tcagcaactt acatgatatg 

tttaggctafc aagaggctga ctaaaagcaa tccaatctca 

acctcgctct a«iaggccga a aa a a t c ag eg c t eg a c a egg 

2 521 actcattgtc accacccgfcc acctaaaacc tactcagcgc eggea aagga geca c ggg 1 1 

2 581 ctagcaacca acctacctgc tgaaattcga acacccaaac aacctgtcaa tafcetafctcg 

2641 aagcgaatgc agattgaaga aacctt.ccga gacttgaaaa gtcctgccta eggactagge 

2701 ctacgccata gecgaacgag cagcccagag cgttt*:gata tcacgctgct aatcgccctg 

2761 atgettcaac caacatgttg gettgeggge gttcatgetc agaaacaagg ttgggacaag 

2821 cacttccagg ctaacacagc cagaaatcga aacgtactcc caacagfcccg cttaggcacg 

2 881 gaagttctgc ggcat.cct.gg ccacacaaca acaagggaag aettactege ggct.gcaacc 
2S41 ctactagctc aaaatccatc cacacatggt caegecttgg ggaaattatg acaatgatcc 

3 001 agaccactfcc fcggctaataa aagafceagag ctctagagat ccgcgtg^tg gttcttcgtg 
30»1 gatctgetgt gecttctagt: tgccagccat ctgttgtttg cccctccccc gtgccttccc 
3121 cgaccctgga aggtgecac£ cccactgt.ee tttcctaata aaatgaggaa actgcatcgc 
3 1S1 ar.cgt.cr. gag cagg* gecac t.ccattctgg ggggtggggt ggggcagcac agcaaggggg 
3241 aggar.tggga agacaatagc aggcatgetg gggargcggt gggctctatg ggtacctcCc 
; 301 tctctccctc cctcr.ctctc tctctctctc tetcteggta ccctctetcte r.crctctctc 
; 361 tcf-ctcccfcc tctccctctc teggtaccag gtgetgaaga attgaccegg tgaccaaagg 
3421 tgecettfcat catcacttca aaaataaaaa acaafctacCc agtgcccgtc ataagcagca 
2481 atr,aat£atg att-gatgect a cat c a c a a c aaaaactgat fctaacaaatg gtfcggfcct.gc 
3541 cteagaaagc atatctgaac ar.t.atctr.ga ttacattatt gacaataata aaaaccttat 
2'601 ccctatccaa gaagtga t.gc cr.atcatr.gg ttggaatgaa cttgaaaaaa attagecttg 
j 661 aacacaccac eggtaaggta aacgccar.tg tcagcaaact gatccaagag aaccaaccta 
Z 721 aagccr.teck gacggaat.gc t aat t. cc eg c. tgaccctgag caccgatgaa tccccctaatg 
3 781 atcttggtaa aaatcar.t.aa gtt.aaggr.gg atacaeatct tgtcatatga teceggtaat 
3841 gtgagttagc tcacsccatta ggcaccccag gctctacact. ttatgettec ggctcgtatg 
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3 901 ttgtgtggaa ctgtgagcgg ataaea&ttt; cacacaggaa acagctatga ceacgattac 

3 9$ a gceaagcgcg eaattaaccc tcactaaagg gaacaaaagc txggagctcca ccgcggtggc 
4021 ggecgefceta gaactagtgg atececcggg cafccagatcg gctat tggcc attgcatacg 
4082 tfcgtatccat accataatafc gtacattcac attggctcat gfcccaacatx accgccatgt 
4142 tgacattgat tattgactag ttatcaatag taateaatta cggggccafct agtfccatagc 
4202 ccatatatgg agfctccgegt tacataaetx acggtaaatg geecgcetgg ctgaccgccc 
4262 aacg&ecccc geccattgac gtcaafcaatg acgfcatgtfcc ccatagfcaac gccaataggg 

4 322 aetttccatfc gaegtcaacg ggtggagfcat ttaeggtaaa ctgcccacfcfc ggcagtacat 
4 3&1 caagtgtatc acatgccaag tacgccccet attgacgtca atgaeggtaa atggcccgcc 
4 441 tggcattatg cccagtacafc gacectatgg gactttccta efcfcggc&gta catccacgca 
4 501 ttagteaecg cfcatfcaccat ggcgatgegg ttttggcagt acafccaatgg gcgtggacag 
-';561 cggettgact eaeggggatt tccaagtcte caccccafcfcg acgtcaafcgg gagtctgtfct 
4 621 tggcaccaaa atcaacggga ctttccasaa fcgfccgtaaca actccgcccc attgacgcaa 
4681 stgggcggfca ggcgtgtaeg gtgggaggte tatataagca gagctegttt agtgaacegt 
4 741 cagatcgcct ggagacgcca tccacgctgt. tttgacctce atagaagaea ccgggaccga 
4&G1 tccagccfccc gcggccggga acggtgcatt gga&cgcgga tteccegtgc caagagcgac 
4 861 gtaagtaccg ecfcatagacfc cfcataggcac acccetttgg ctettacgea tgclatactg 
4 921 tcfctttggctt: ggggcctafca cacccccgct tccttatgct ataggtgatg gtatagctta 
4981 gcctataggfc gtgggfctafct gaccatfcatt gaccactccc ctattggtga egatactttc 
5041 cafc£actaa£ ccafcaacafcg gctcttcfcgcc acaaccatcfc ecatfcggcta fcacgccaata 
5101 cfccfcgtccfct cagagactga cacggactet gtattttcae aggatgggge. eccatttatt 
5181 att-tacaaat fceacafcafcac aacaacgccg tcccccgtgc cegcagtt tz tattaaacat 
5221 agcgtgggat ctccacgcga afcctcgggta cgtgtfcccgg acatgggctc fctctecggta 
5281 gcggcggagc ttceacafccc gagecefcggt cccatgcc^c cagcggctca tggtcgctcg 
5341 gcagctectt: gcfccctaaca gtggaggcca gactfcaggca cagcacaafcg cccaccaeca 
54 01 ccagtgtgcc gcacaaggcc gtggcggtag ggtatgtgtc txgaaaafcgag cgtggagatx 
5481 gggcecgcac ggccgaegca gatggaagac fcta&ggcagc ggcagaagaa gatgcaggca 
S52X gcfcgagttgt tgfcatfccfcga caagagfccag aggtaactcc cgt£gcgg£g ctgfctaacgg 
??5S1 tggagggcag tgtagfcctga gcagt&ctcg ttgctgcegc gcgcgccaee agacataata 
:i€41 gc ttgacagac taacagacfcg tfccctttcca tgggtetttt ctgcagtcac cgtegggatc 
r>701 catgggotcc afccggcgcag caagcaitgga attttgtctt gatgtafcfcca aggagctcaa 
S7S1 agticcaccat gecaatgaga acatcsfceta ctgccccatt gccatcacgt cagctctiagc 
5821 catggtafcae ctgggtgeaa aagacagcac caggacacag ataaataagg fctgetcgett 
e *»8Sl "cgataaactc ccaggattcg gagacagta£ tgaagctcag tgtggcacat ctgtaaacgt 
-»941 tcactcttca cttagagaca tccecaacca aafccaccaaa ccaaatgatg tfctatfccgtt 
b002 cagccttgcc agtagacttt atgetgaaga gagafcaccca atcctgccag aatacctgea 
6062 gtg£:gtgaag gaactgtata gaggaggctt ggaacctatc aactttcaaa cagctgeaga 
6122 teaagecaga gagctcatca attcctgggc agaaagtcag acaaaeggaa ttateagaaa 
^•181 tgcccttcag ccaagctccg tggattctca aactgeaatg gttctggtta atgccattgt 
6241 cttcaaagga ctgtgggaga aaacattxtaa ggatgaagac acacaagcaa tgcctttcag 
S3 01 agegactgag csagaiaagca aacctgtgca gatgatgtac cagateggtt tatr.tagagt 
6361 ggcatcaatg gcttctgaga aaatgaagat cctggsgcct ccafcttgcca gtgggacaat 
6421 gascat:gt£g gcgctgttgc ctgatgaagt ct caggccct gagcagcttg agagta?:aas: 
64 81 caactttgaa aaactgactg aatggaccag ?:tctaatgtt atggaagaga ggaagat:caa 
6 541 ag^gt.actta cctcgcacga agatggagga aaaatacaac ctcacatcttg fcettaa^gge 
6601 tatgggcatt actgacgtgt ttagctctt:c agecaatetg t.ctggcacct cctcagcaga 
6661 gagectgaag afcatctcaag ctgtccatgc ageacatgea gaaatcaatg aagcaggcag 
€721 agaggtggta gggt:csgcag aggctggagt ggatgetgea agcgtc&ctg aagaatctag 
67S1 ggctgaccat ccattcctct tctgtatcaa gcacatcgca accaacgccg ttctcttctt 
6841 £ggcagacg£ gtfctcccgcg gccagcagac gacgcaccag cagatgaege accagcagat 
t<$Ql gacgcaccag cagatgaege aecagcagat gacgcaccag cagatgaege aacaacatgt 
f:S?62 atcctgaaag gctctfcgtgg ctggatcggc ctgctggatg acgatgaeaa atttgtgaac 
7G22 caacascctgt geggctcaca cctggtggaa gctctetacc fcagtgfcgegg ggaacgaggc 
7081 ttcttctaca cacccaagac ccgeegggag gcagaggacc tgcaggtggg gcaggtggag 
7141 ctgggegggg gccctggtgc aggcagectg cagcccfcfcgg ccctggaggg gtccctgcag 
^201 aagcgtggca ttgtggaaca atgctgtacc ageatctget cectctacca gctggagaac 

"261 tactgeaact agggegecta aagggegaat catcgcggcc gctctagacc aggegectgg 
'321 atccagatca cttctggcta ataaaagatc agagecctag ag&tctgfcgt gttggtcttt 
7381 tgcggatctg ctgtgccttc tagtcgecag ccatctgttg ctcttgcccctc ccccgtgcct 
7441 tccttgaccc tggaaggtgc cactcecact gtcctttcct aataaaatga ggaaattgea 
7501 ecgcactgtc tgagtaggtg £cattetatt ctggggggtg gggtggggca gcacagcaag 
7561 ggggaggatt gggaagacaa tagcaggcat gctggggatg cggtgggctc tatgggtacc 
7621 tctcfcctctc tctctctctc tctctctccc tctctctctc ggcacctcte ctcgaggggg 
7681 ggcccggtac ecaattcgcc cfcatagtgag tcgcatcacg cgcgctcact ggccgtcgtt 
7741 cc&caacgtc gtgactxggga aaaccetgge gttacccaac teaategect tgcagcacac 
7801 ccccctttcg ecagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcceaaeag 
7861 ttgcgcagcc tgaatggega atggaaattg taagegttaa tattteg?xta aaattcgcgt 
7921 fcaaatttfcfcg tfcaaatcagc tcatttttta accaafcagge cgaaaccggc aaaacccctt 
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>S81 
8101 

8XSX 

J221 
^281 
#341 
8401 

it 521 
S581 
8641 
8701 

88 21 

ssei 

8941 
9001 
9061 
9X21 
<U81 

■f)301 
^361 
9421 
94S1 
V 541 
>601 

9721 
9781 
0841 
9901 
9961 
10021 
10QS1 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 

10741 
10801 
X«*8SX 
lt'<S2l 
I s . 981 
11041 
11101 
11X61 
11221 



afcaaatcaaa 
GaCt 3fc t a S££ 

gc cc ac t a £ t 

C a £3 o& & C CI a t 1 

aagcttatga 
acatgegaaa 
cggcttttta 
ttcttfcateg 
aacateatct 
ggggtcaaca 

a t tCC 

gggecaataa 
tgatgactct 
ccagccaata 

tggctaf; tct 
tgaaaageca 
tggattggct 
gtataccgat 
ggtcafcagct 
ceggsagea t 
cgt tgegetc 
tcggccaacg 
etgactegct 
caatacggfct 
agcaaaaggc 
cccctgacga 
tataaagata 
tgecget tac 
gcccacgctg 
acGjaactrC cc 
acceggtaag acacgactta 
egaggtaegt 
gaaggacagt 
gtageccttg 
agcagattae 
ctgacgctca 
ggafcefctcac 



agaatagace gagatagggt 
gaacgtggac teeaaegtea 
cegggatcat atgacaagat 
&99%S&ttc& teagtgetea 
tgatgatgtg ettaaaaaefc 
aaectaaaag agettgeega 
tegagetttga aagata&ata 
taaaaaafcgc cctcttgggt 
ggtgacgaaa t&actaagca 
tgaaggtcat cgatagcagg 
agecatccca aattggtagt 
caeeggt tgc at tggtaagg 
" eg 1 1 tggat: agaeat cac t 
a aa c t aa a a c agggaa a a c fc 
tactafcetge aataaafcccg 
tectgeeaca aaggcttgga 
aecatcacge tafctcateat 
ateaatgege fcga&afcaata 
c ag c 1 1 1 fc g t t cc c t 1 t ag t 



gfct fccetgtg fcgaaat tgt t 
asiagtgt&aa gcctggggtg 



actgcccgct t 
cgeggggaga 

g cgc t eg g te gtt egg ct gc 
atccacagaa teaggggata 
caggaaecge aaaaaggccg 
geatcacaaa aategaeget 



ccaggcgtfct 
eggaeacetg 



ccccctggaa 
tcegcctttc 
ageteggtgt 
gasegetgeg 



ar.etggtafcc 
atceggcaaa 



tctgtctatt 

eaaetttatc 
cgccagfc taa 
cgtcgtfctgg 
cccccatgtt 
agtcggccgc 
tgccatccgt 
agtgtatgcg 
afcagcagaac 
ggatcetaec 
cagcatcttt 
caaaaaaggg 
at tactgaag 
agaaaaataa 



gtggaacgaa 
etagatcefce 
fct.ggtct.ttac 
tegtccaecc 
accaccfcggc 
ate ag c aa t a 
egcctccatc 



acagagttet 
cgcgctctgc 
eaaaceaccg 



ttaaattaaa 



atagtfcgcct 
cecagfcgetg 
aaeeagccag 
cagtctafcta 



tatggettca fctcagcfcccg 
g t gc a aaa aa 
a tea 



gcgaccgagt 



tactttcacc 
aat&agggcg 
cacttatcag 



tgagcgttgt 

gtgtaticcac 
gggtcaacga 

tactcaatgg 
taaaaaaggc 
aaacagatag 
tatcaagagg 
cctgtctccfc 
ataacaatac 
gaacgafctac 

ccctgtaavg 

a a c c a a c c t 

agcagtactg 

atactgagtg 

cacgat ttcc 

atcaacaaat 

S#g93£taat 

atccgetcac 

cctaatgagt 

gaaacctgtc 

gtattgggcg 

ggcgagcggt 

aegcaggaaa 

cgttgctggc 

caatttcagag 

gccccctcgt 

tecefcteggg 

aggtegtteg 

ccfccaKccgg 

cagcagccac 

tgaagtggtg 

tgaagccagt 

ctggtagegg 

aagaagatcc 

aagggatttt 

aatgaagctt 

gcttaatca3 

g a c c c c c eg v. 

caacgatacc 

ceggaaggge 

attgttgccg 

ccattgctac 

i3' "♦"jr* 



tccagtttgg 
aacegtctat 

gaattaaeat 
cfcggttatgc 
caatttattg 
gttttattfcg 
gtcattatac 
gtttactccc 
ag t aaa a cgc 
aaataacagc 
atccctgtaa 
caggtaaagc 
cagacafcaaa 
cegfcfcfcfctfcc 
taaaagacca 
g^aatagcac 

ggcatcgtta 
tgcgcgctcg 
aattccacac 
gagctaaccc 
gtgcca«ctg 

CCCt£CCf3Ct 

at cage £ cac 
gaaeatgt9a 
gtttttccas: 
grggegaaac 
gcgc^cfccc it 
aagcgcggcg 
etccaagctg 



t. get. ct tgc c 



t 

agegtt 
acaeggaaac 
ggttategtc 
gttecgegca 



tggcagcacr, 
gtgagtactc 
egg eg tea at 
gaaaaegtte 
fcgtaaeecac 



aaeaagag^c 
cagggegatg 
tgaetfcKtac 
teegtcagga 
atategcaat 
ccatttaeeg 
aagctaaatc 
ttcgeggast 
ctgagcttga 
taaaccaaca 
aaacagtaat: 
ageaccsc tgc 
gatcccacca 
cgctaaaaag 
gecea tv.tag 
agaceegtaa 
cacacegtgc 
aacaagtgat 
gegtaatcat 
aacaCacgag 
acatcaactg 
eattaatgaa 
tectcgetca 
teaaagaegg 
geaaaaggee 
aggcfccegcc 
ccgacaggac 
gttccgacee 
ctttctcata 
ggctgt:gt:gc 
r.aactategt ctt^agtcca 
fcgg t aa cagg a c t ag cag ag 
gec t a « c t a e ggc t a cac t a 
taccttegga aaaagagttg 
tggttttttt 
ttcgatcttt 
ggtcatgaga 
taaa" caatc 
tgaggcacct 
cgtgcagata 
gcgagaceca 
c gag e g caga 
ggaagctaga 
aggcategtg 
ateaaggega 
tecgategtt 
gcataattcf 
aaccaagtea 
aegggataat 
tteggggega 
tcgtgcaccc 



ccfcacggggt 
ttatcaaaaa 
at at 



agtggtcctg 
gfcaagtagtt 



tctg ggtgageaaa aaeaggaagg 



gecgaatact 
t catgagegg 
eatttctecg 



catactet tc 
atacatatte 
aaaagtgcea 



3 

gtLCagaagta 
ettaetgeca 
ttetgagaat 
a c eg ogc c a c 
aaac tcteaa 
aaetgatett 
caaaatgecg 
ecttttcaat 
gaa tgtattc 
c 



S£Q ID ^EO:43 (pTnMOD (Chickan OVep+OVg' +^T + proins^sya polyA) ) 



1 ctgaegegee etgtagegge geattaageg 

61 ccgctacact tgeeagegee ctagcgcecg 

121 ceacgttcgc eggeatcaga ttggctattg 

181 tatgtacatt tataetggct catgt 

-241 tagttat r.aa tagtaaecaa ttacggggte 

301 cgttaeataa ettaeggtaa atggcccgcc 

3 61 gaegtcaata atgacgtatg tccccatagt 



eggegggtgc ggtggttaeg egcagegtga 
ctcetttcgc tctcttcect tectttctcg 
gecattgeat acgttgtatc catateataa 
attacegcea tgttgacatt gafctafetgac 
attagtccat ageccatata tggagttccg 
tggctgaceg cccaacgace ccegeecafct 
aacgeeaata gggactttec attgaegtca 
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421 atgggfcggag tafctsacggt aaactgecca 
481 aagtaegeec ecta&tgacg fccaatgacgg 
541 catgaeefcta tgggactfcfcc ctacttggca 
601 catggtgatg eggtttcggc agtac&tcaa 
661 attfcccaagt: ctccacccca fctgacgtcaa 
721 ggacttfceca aaatgfccgta acaactcegc 
7S1 acggtgggag gtctafcataa gcagagctcg 
841 ccatecacge cgexttgace cccatagaag 
901 ggaacggtge atcggaaege ggaetecccg 
961 actcxatagg eacaeeectt fcggctctcat 
1021 atacaeccec gettccttat gctataggtg 
1031 atcgace&fct attgaccact: ccectatfcgg 
V141 atggctctfct gccacaacca tctctattgg 
",201 tgacacggac tctgfcatttt: tacaggatgg 
\2£1 tacaacaacg ccgtccceeg tgcccgeagt 
^321 cgaafcctcgg gtacgtgttc eggacatggg 
_38I tecgagccct^ ggtcccatgc cfcccagcggc 
1441 acagtggagg ccagacfctag gcaeageaca 
\S01 gecgtggcgg tagggtatgfc gtctgaaaat 
15 £3. geagatggaa gactfcaaggc agcggcagaa 
-621 tgascaagagt cagaggtaac tcccgctgcg 
i.6Sl tgagcagfcae tcgttgctgc cgcgegegee 
1741 cfcgttee£fct eeatgggtcc ttcetgcagt 
ISOl tfcacaeg&ct etctfcfcacca attctgcccc 
X861 acgttggcct gccacgcafct actfcgaccgt 
1921 aaectgccaa ccaaagegag aacaaaacat 
1381 aatcgtcacc tccacaaaga gcgacteget 
2041 fccgggcaata cgafcgcccafc tgfcacttgfct 
2101 cttatggtat tgcgagcttc agtcgcacta 
2161 gcgttccegc tttcagagca atgtfccaaag 
2221 gcgagcattc taecgagt&a caccacaccg 
2281 ccatggfcata aatccgttga g&agctgggt 
2341 gtacaafcatg cagacctagg agcggaaaae 
24 01 tcatetagtc actcaaagac ttfcaggctat 
2461 tgecaaattc tattgtataa atctcgctcE 
2 e i21 actcatcgtc accacccgtc accfcaaaafcc 
^581 ctagcaacta acttacctgt tgaaattcga 
::641 aagcgaatgc agattgaaga aaccfctcega 
^701 ctacgccasca gccgaaegag eagctcagag 
2761 atgcttcaac taacatgttg gcfcfcgegggc 
2821 cacttccagg etaacacagt cagaaatcga 
Z'BBl gaagttttgc ggcattcfcgg ctacaea&fca 
T 941 c t actagc tc aaaatfc tat t cacacatggt 
1001 tceagagega teegggatet cggg&aaagc 
3G£1 ctttaaaaat aaaaaacaat; taetcagtgc 
3121 tgeetaeate acaacaaaaa ctgafcttaac 
ZlBl cgaacafctat cttgattata ttattgataa 
3242 gatgeefcate attggfcfcgga atgaacttga 
3 3 01 aggtaaaege cafctgtcage aaattgatcc 
3 361 aatgfctaatt etegttgace cfcgagcactg 
3421 actaagfcfcaa ggtggataca catcttgcca 
3481 eattaggcac cecaggcttfc acaefcctatg 
3 541 ageggataac aatttcacac aggaaaeagc 
3 601 aacccfccacc aaagggaaea aaagctggag 
3661 agfcggatccc ccgggcfcgca gaaaa&tgcc 
3721 ettgaeetga taeccgattt tcttcaaact 
3781 eagagagasa ccatcactga tggctaeagc 

3 841 attcatctgt gacctgagca aaatgatfcta 
?901 atgaaaaggc aatttecaca ctcaeaaeat 
: $61 tgefceecccc caatgtcaaa atfcgfcagtgg 

4 021 gfcaggtfctta gtgattggat aagaggcttt 
4081 ttttggafcaa aaagfcgcfctfc: tafcaaetfctc 
4141 tggtttaggg acagacccac aatgaaatgc 
<20l ctgacctttt cttgggacaa gcatfcgtcaa 
^261 tcgcacagct gtgctgggca gggcaatcca 
^321 gcaagaagat: tgttgcttac tctctccaga 
4 381 ggtagagata acattKaefcg ggaagcacat 
4 441 agactttctt agtggctgaa acagaagcaa 



cttggcagta catcaagtgt atcatascgcc 
taaatggcce gectggcatt atgcccagta 
gtacatctac gfcatfcagtca tcgctactac 
tgggcgcgga tageggtteg actcaegggg 
tgggagtttg fctttggeacc aaaatcaacg 
cccattgacg caaatgggeg gcaggcgtgt: 
tsctagtgaac egtcagateg cctggagacg 
acaccgggac cgatccagcc tccgcggccg 
tgccaagagt gaegtaagta ccgcctatag 
geatgecaca ctgttcttgg ettggggect 
aeggcatage ttageccata ggtgtgggtt 
tgacgatact ttccattacfc aatccacaac 
ctatacgcca atactctgtc cttcagagac 
ggtcccat.CC actacttaca aatctcacata 
ttttattaaa catagcgtgg gatctccacg 
ctcttctccg gtageggegg agcttccaca 
teatggtege tcggcagctc cttgctccta 
atgcccacca ccaccagcgt gccgcacaag 
gagcgtggag attgggctcg cacggctgac 
gaagatgcag g cage t gag t tgttgfcatcc 
gtgctgttaa cggtggaggg cagtgetagtc 
accagacata atagctgaca gactaacaga 
caccgtcgga ccatgtgcga actcgatact 
gaattacact taaaacgact caacagctta 
aaaactccca ctcttacc^a acttggccgt 
aacafceaaac gaatcgaccg attgrxaggt 
gtataccgtt ggcatgcfcag etefctateegt 
gactggtccg atattcgtga gcaaaaacga 
cacggfccgtt ctgtfc&ctct fctatgagaaa 
aaagctcatg accaatt:tct: agccgacctt 
ctcattgtca gtgatgctgg ctttaaagtg 
tggtactggt taagtcgagt aagaggaaaa 
tggaaaccfca tcagcaacct acatgatatg 
aagaggctga ctaaaagcaa tccaatctca 
aaaggccgaa aaaatcagcg ctcgacacgg 
tactcagegt eggcaaagga gecaegggtt 
acacccaaac aactcgttaa tatetattcg 
gacttgaaaa gccctgccta eggactagge 
cgttttgata teatgetget aatcgccctg 
gttcatgetc agaaacaagg ttgggacaag 
aaegtactet caacagttcg cttaggcatg 
acaagggaag acctactcgt ggctgcaacc 
tacgetttgg ggaaattatg aggggatege 
gttggtgacc aaaggtgcct tttatcatca 
ctgfctataag cagcaattaa ttatgattga 
aaatggttgg cctgcctt^ag aaagtatatt 
taa£aaaaac cttatcccta tecaagaagt 
aaaaaattag ccttgaatac attactggta 
aagagaacca acttaaagct fcteetgaegg 
atgaatcccc taatgatttt ggcaaaaatc 
tatgatcccg gtaatgtgag ttagctcact: 
cttrccggctc gtatgttgfcg tggaattgtg 
tatgaceatg atfcacgccaa gegegcaatt 
ctccaccgcg gtggcggccg ctctagaact 
aggtggacta tgaactcaca tccaaaggag 
ggggaaacaa cacaatccca caaaacagct 
accaaggtac geaaeggcaa tccattcgac 
tctctccatg aatggttgct tctttccctc 
gcaacaaaga caaacagaga acaattaatg 
caaagaggag aacaaaatct caagttctga 
gaecfcgtgag cccacctgga cttcatatcc 
aggtctccga gtctttattc atgagactgt 
ctggcatagg aaagggcagc agagecctag 
acaatgtgtg acaaaactat ttgtactgcfc 
ttgccaccta tcccaggtaa cctfcccaacfc 
aagcttetgc agactgacat gcacttcata 
ctat;catcat aaaaagcagg caagattttc 
aagacgtgat taaaaacaaa atgaaacaaa 



EP 1 539 785 B1 



4501 aaaaatcagfc tg&tacctgt ggtgtagaca tccagcaaaa aaatattatt tgcactacea 



2 



agaafcgtaga tfctctacagt acatatgttt 
atggcactga ctaaacttca gctagsiggca 
cagtgatctc tatgtatgfcc cfcgaagaatc 



4561 tcfctgcctta agtccfccaga cttggcaagg 
4<S2 1 tcacaaaagg aaggagagaa aeaaaaajaaa 
4681 taggaaagta attctgctta acag&gafctg 

4742 atgctgtact fc& 1 1 tccccc atttttaa^t caa&cagfcgc ttfcacagagg teagaatggt 

4802 ctct ttactg tttgtcaats: ctatfcattec aatacagaac aatagcttct ataaetgaaa 

»i8£2 tatatttgct atxgtafc&fct: afcgattgtcc ctcgaaccat g&acactcct ccagctgaat 

- : Sf 2 1 ctcaeaattc ctctgtcatc tgec&ggcca ttaagttatfc catggaagat etttgaggaa 

43*82 cactgeaagt fccatatc&ta aac&catttg aaattg&gta ttgfctfcfcgca ttgfcatggag 

:l04tl ctatgttttg ctgtatccfcc agaaaaaaag tttgttatac-j agcattcaca CGcataaasa 

1u 5102 gatagattfca aatattceag ctat&ggaaa gaaagtgcgfc ctgctettea ctctagtctc 

r»l$l agttggctcc ttcacatgca tgettcttt:a tfcfcctcctat ttcgtcaaga aaataatagg 

:>22X teacgtcttg tcctcactta tgtcctgcct agca^ggefcc agatgcaegt tgtagataea 

i>2Ql agaaggatca aatgaaacag acttetggtc tgttactaca aecatagtaa taagcacact 

53^1 aactaataat fcgctaattat gttttecatc tctaaggttc ceaeattfctfc cfcgtttfcctt 

54 01 aaagatccca ttatctggtt gcaaetgaag cccaatggaa catgagcaat: afcttcecagt 

5 54 61 ettctctccc atccaacagt cctgatggat fcagesagaaca ggeagaaaac aeattgttac 

5521 ccagaattaa aaactaaeat tcgcfcctcca eteaatceaa aatggaceca ttgaaactaa 

5561 aatcfcaaccc aateecacta aatgattfcct atggcgvcaa aggtcaaact tctgaaggga 

5641 acctgtgggt gggtc<acaat teaggctata tattccccag ggctcagcca gtggatcaac 

5701 ac.aeagctag aaagctgcat tgcefcttagc aetcaagctc: aaaagacaac tcagagttca 

5761 ccatgggctc catcggcgca gcaagcatgg aatfcttgttfc tgatgtattc aaggagct.ca 

'■:o 5821 aagtccacca tgccaatgag aacatcttct actgceccafc tgecatcatg tcagctctag 

5881 ccatggt&ta ccfcgggtgca aaagacagca ccaggacaca gataaafcaag gttgttcgct: 

5941 ttgafcaaacfc tccaggatte ggagacagta fctgaagctea gcgtggcaca tctgtaaacg 

6001 tr.cactcfcfcc actxagagac acectcaacc aaatcaccaa aceaaatgat gttfcattcgt 

6061 tcagccttgc eagtagactt tafcgcfcgaag agagatacce aatcctgeca gaatacfctgc 

til 21 agtgtgtgaa ggaactgfcat agaggaggcfc tggaacctat: caactttcaa acagctgcag 

*'181 ateaagceag agagctcate aattcecggg tagaaagtca gaeaaafcgga att&tcagaa 

f 242 atgtccttca gccaagctcc gtgg&ttctc aaacfcgcaat ggttctggtt aatgccattg 

t?302 tetfccaaagg actgtgggag aaaacat:tt;a agg&fcg&aga cacacaagca afcgeett-tea 

f~361 gagtgacfcga gcaagaaagc aaacetgtgc agatgatgta ccagafctggt ttatttagag 

64.22 tggcatcaat ggctfcetgag aaaafcgaaga tcctggagct tecatttgec agtgggacaa 

64S1 tgagcatgtc ggcgcEgtcg cctgafcgaag tctcaggcct tgagcagctc gagagtataa 

_ fi f S41 tcaacttcga aaaactgact gaatggacca gttccaatgt tatggaagag aggaagatca 

t-SOl aagt^tactc acctcgcatg aagatggagg aaaaatacaa cctcacatct gtetctaatgg 

b&Sl ccatgg9cat tactgacgeg tr.cagctcct cagccaafect gtctggcatc teetcagcag 

6721 agagcctgaa gatatcteaa gcfegtceakg cagcacatgc agaaatcaat gaagcaggca 

^•781 gagaggcggt. a«gg t cagca gaggc f ggag tggat.gctgc aagcgtctct gaagaattta 



i: 841 gggctgacca tccattcctc Ctccgtatca ageacatcge aaccaacgcc gttctcttct 

6901 ttggcagatg tgtttctccg cggccagcag atgacgcacc agcagafcgac gcaccagcag 

6561 acgacgcacc agcagatgac gcaccagcag afcgacgcacc agcagat^ac gcaacaacafc 

7021 gtafccctgaa aggcfccttgt ggetggattcg gcctgcfcgga tgacgatgac- aaatttgtga 



ggggaaccjag 
3Sgcaggvgg 



7081 accaacacct gtgcggctca caccfcg^tgg aagctctcta cctagt.gt-.3c 

7142 0ct£ctt eta cacac c caa g a cc cgc eggg aggcagagga ectgeaggtg 

7202 agetgggegg gggc c c:i: gg t gcaggca 3 c c tgcagccctt ggccctggag gggtccctgc 

72 62 agaagogtgg cattgtggaa caatgctgta ccagcatctg ctccctctac cagctg^aga 

40 73 22 act&ctgcaa ctagggcgcc caaagggega atfcaccgcgg ccgccctaga ccaggcgcct 

7382 ggacccagat caectctggc caacaaaaga tcagagctct agagacctgt gtgccggctc 

7442 tttgtggatc tgctgtgcct tetagttgee agecatctgt tgetcgcccc tccccegtgc 

7S02 cttccttgac cctggaaggt gceacxccca ctgtcctttc ctaacaaaac gaggaaattg 

: S62 catcgcactg tctgagttagg tgtcactcca ttctgggggg cggggtgggg cagcacagca 

"621 agggggagga ttgggaagac aacagcaggc afcgctgggga tgcgg&gggc tctatgggta 

4 l J ^SSl cctccctctc teectetefcc tctctctccc tccctctctc tcggtacccc tcccgagggg 

7741 gggcccggta cccaatccgc cctatagtga gtegtattae gcgcgcccac cggccgtcgt 

7S01 tttacaacgt cgtgactggg aaaaccctgg cgctacccaa cttaatcgcc ttgeagcaca 

' 861 tcccccttfcc gccagctggc geaatagega agaggcccgc accgatcgcc ctteccaaca 

* 521 gttgegcage ctgaatg^cg aatggaaatt gtaagegtta atattttget aaaattcgcg 

"981 tta&afctttt gttaaatcag efceastttet aaccaatagg ccgaaategg caaaatccct 

5(5 S041 tattaaatcaa aagaas:agac cgag&£aggg t. cgagtgs: tg tt.ccagt.tfcg gaacaagagtx 

8101 ccaccattaa agaacgtgga ctccaacgcc aaag^gcgaa aaaccgf.cta r.cagggcgat 

&161 ggcccaccac reegggatea tatgacaaga tgtgtatcea ccttaactta atgatcttGa 

82 21 ccaaaatcat * aggggat r.c at cagegec. c agggtcaacg agaactaaca fctcegtcagg 

S2S1 aaagcttatg atgatgatgt gcc.taaaaac ttactcaatg gctggttatg cata&cgcaa 

8341 tacatgegaa aaaccfcaaaa gagctfcgccg ataaaaaagg ccaatttatc gctatttacc 

8401 geggcttett attgagcttg aaagacaaaic aaaatagata ggt tr.tattt gaagctaaat 

8461 ttcctttatc gtaaaaaacg ccctcttggg tfcatcaagag gg teat tat a tttcgcggaa 

8S21 taacatcatt tggtgacgaa ataaccaagc acttgcctcc tgttfcactcc cctgagcttg 



a? 
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iSSSl aggggtcaae acgaaggfcea tcgatagcag g&taataata c&gcaaaaeg ctaasccaac 

'5641 aatccaaatc cagccafccce aaatfcggtag ^gaatgaeta t&aafcaacag eaaaeagtaa 

8701 tgggecaata acaccggttg cattggtaag geccaccaat aatcccfcgta aagcaccttg 

S761 ctgatgaetc ttcgtttgga cagacatcac tecctgtaat gcaggtaaag cgatcccacc 

*821 accagccaat aaaactaaaa cagggaaaac taaccaaect tcagatacaa acgcta&aaa 

s881 ggca&atgea ctaetatecg caacaaatcc gagcagtact gccgtttttt: cgcccattta 

*;94i gtggccattc ttcctgccac aaaggcctgg aataetgagt gcaaaagacc aagacccgta 

9001 afcgaaaagcc aaccateatg ctattcatea teacgatttc tg£aa£agea ccacaccgcg 

9061 cfcggatcggc tatcaafcgcg ctgaaafcaat aatcaacaaa tggeatcgtfc aaacaagtga 

3X21 £gcataccga tcagcctttg ttceccttag tgagggttaa ttgcgegctc ggcgtaatca 

9X81 tggtcatagc cgtttcctgs: gtgaaattgfc tatccgcfcca caattccaca caacatacga 

9241 gccggaagca taaagtgtaa agcctggggt gccfcaatgag tgagctaaet: cacatfcaact 

9301 gcgtcgeget cacegccegc tctccagtcg ggaaaectgt cgtgccagcc gc&xcaatga 

afccggccaac gcgcggggag aggeggtttg cgtafctgggc gctcttecgc ttcetcgccc 

9^21 accgactcgc tgcgcceggt: cgfctcggcfcg cggcgagcgg taccagctca cfccaaaggcg 

9481 gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc 

95^1 cagcaaaagg ceaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 

9601 ccccctgacg agcat.cacaa aaatcgacgc tcaagteaga ggtggegaaa cccgacagga 

9661 ctataaagat aecaggcgt£ tccccctgga agctccctcg cgcgctcfcce tgctccgacc 

9'?21 ctgccgctta ccggatacct gtccgccttt ctececfecgg gaa^egtggc getfctctcafc 

9781 agctcacgct gtaggtacct eagctcggtg taggtcgttc gctccaagct gggctgtgtg 

^341 cacgaacccc cegtfceagcc cgaccgctgc gccccatccg gt&acfcafccg tctfcgagtcc 

^901 aaeccggtaa gacacgactt ategccactg gcageagcca ctggtaacag gatfcagcaga 

9361 gegaggtatg taggcggtge tacagagttc ttigaagtggt ggceeaacta cggctacact 

10051 agaaggacag tatttggtat ctgcgctcfcg ctgaagccag tfcacctfccgg saaaagagtt 

10081 ggtagcfcctfc gatccggcaa acaaaccacc gctggtagcg gfcggtttfcts: tgtfctgcaag 

1^3.41 cagcaga£ta cgcgcagaaa aaaaggatct caagaagatc ccttgattctx txefcacgggg 

1**201 tctgacgctc agfcggaacga aaactcacgt taagggafctt tggccafcgag actatcaaaa 

1 -261 aggatcctca cctagatccfc fcfcfcaaatfcaa aaatgaagtt ttaaatcaac ctaaagtata 

10321 tatgagtaaa ctcggtetga cagttaccaa tgcctaatca gcgaggcacc tatctcagcg 

10381 atcfcgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagac aacfcacgata 

10441 cgggagggec taceacetgg ccecagtgct. geaafcgatac cgcgagaccc acgctcaecg 

10 501 gctccagatt tateagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct 

i05>61 gcaactctac ccgeccccat ccagtctatt aattgttgcc gggaagctag agtaagtagt 

10621 tegccagtta atagtttgcg caacgttgtt gccattgce.a caggcatcgt ggtgtcacgc 

10681 tcgttcgtttg gtstggctcc at t cage tec ggtccccaac gatcaaggcg agttacatga 

10741 teccccatgt. tgcgcaaaaa ageggttage tccttcggtc ctccgatcgt fcgtcagaagt 

10S01 aagttggccg cagtgeeate actcat:ggt:t atggcagcac tgeataatte fccttactgt:c 

10S61 atgccatccg taagacgett ttctgegact ggtgagtact caaccaagtc actctgagaa 

10921 tagtgtatgc ggegaccgag ttgctcttgc ccggcgtcaa tacgggataa caccgcgcca 

10981 catagcagaa cttcaaaagt gctcatcatt: ggaaaacgfct etteggggeg aaaactctca 

11041 aggatcttac cgctgttgag atccagtccg atgtaaccca ctcgtgcacc caactgatct 

11101 tcagcatcEt ttactctcac cagegtttet: gggtgagcaa aaacaggaag gcaaaatgec 

11161 gcaaaaaagg gaar.aagggc gaeaeggaaa tgttgaatac tcacactctt cctttttcaa 

1.221 tatcatcgaa geatttatca gggtfcafctgt: etcatgageg gacacatatc tgaatgtafiit: 

fcagaaaaata aacaaatagg ggttccgcgc acatttcccc gsaaagtgcc ac 



Chasms 



1. A vector comprising: 

a} a modified transposase gene operably linked to a itrst promoter, wherein the nucleic add sequence 3' io me 
first promoter comprises the sequence as set forth in SEQ \D NO: 13, wherein SEQ ID NO: 13 contains the 
Kozak sequence and a start codon for the transposase, and wherein &i least one of the first twenty eodons of 
the transposase gene are modified from the wild-type sequence by changing a nucleotide at a th^rd base position 
of the codon to an adenine o?- thymine without modifying the amino acid encoded by the codon ; and 
b} one or more genes of interest operabiy^enked to one or more additionai promoters, and wherein the one or 
more genes of interest and thesr operabiy-Hnked promoters are flanked by transposase insertion sequences 
recognized by the transposase encoded by the modified transposase gene, wherein the promoter which Is 
operabfy linked to the gene of interest is seiected from the group consisting of an ovalbumin promoter, a 
conafbumin promoter, a vitellogenin promoter or an ovomucoid promoter 

2, The vector of claim 1 > wherein the modified transposase gene comprises an adenine or thymine at the third position 
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in each of codons 2-10 of the modified transposase gene. 
3c The vector of claim 1, comprising the sequence as set forth in SEQ ID NO: 1 . 

4. The vector of claim 1 . wherein the transposase is a Tn 2 0 transposase. 

5. The vector of claim 1 cr claim 4, wherein the first promoter is selected from the group consisting of a constitutive 
promoter and an inducible promoter. 

8 Q The vector of claim 5 wherein the inducible promoter is selected from the group consisting of an ovalbumin, promoter, 
a conaibumin promoter, a vitellogenin promoter or an ovomucoid promoter, 

7- The vector of claim 1 9 further comprising a polyA sequence operably linked to the transposase gene. 

8. The vector of claim 7, wherein the polyA sequence is a conalburnin potyA sequence. 

The vector of claim 1 or claim 7 further comprising two stop codons operably-linked to the transposase gene, 

10. The vector of claim 1, wherein a first gene of interest is operably-lsnked to a second promoter and a second gene 
of interest is operablyTinked to a third promoter. 

11. The vector of claim 1, wherein a first and a second gene of interest are operabiy-Hnked to a second promoter. 

12. The vector of claim 1 t further comprising an enhancer operablyTinked to the one or more genes of interest, 

13. The vector of claim 12 wherein the enhancer comprises at least a portion of an ovalbumin enhancer. 

14- The vector of claim 1, further comprising an egg directing sequence operably- linked to the one or more genes of 
interest. 

15, The vector of claim 14 wherein the egg directing sequence is an ovalbumin signal sequence, an ovomucoid signal 
sequence or a vitellogenin targeting sequence. 

16, Use of a vector according to any one of claims 1-15 for producing a nomhurnan transgenic animal 

17, The use of claim 18, wherein the vector is administered via an sntratesticular, intraarterial intraoviductai or infrae- 
mhryonic route. 

18, The use of claim 18, wherein the animal is an avian animal 

19, The use of claim 18, wheresn the avian animal is a chicken or a quail. 

2Q. Use of a vector according to any of the eteims t -1 5 to be administered to an animal for producing a desired protein. 

21. The use of claim 20 : wherein the animal is an egg^aying animal, and the administration is intraoviductal such that 
the desired protein produced by the at least one gene of interest is isolated from the egg white of eggs laid by the 
egg-laying animal. 

22. The use of claim 20. wherein the vector further comprises a TAG sequence and wherein the desired protein can be 
purified using the TAG sequence. 

23. The use of claim 22, wherein the TAG secfuence comprises: (I) a sequence that encodes polypeptide that functions 
as a purification handle; (ii) a cleavage site; and (iii) a polynucleotide spacer 

24. The use of claim 22, wherein the TAG sequence comprises a polynucleotide sequence shown in SEO ID NO: 22, 

25. The use of claim 20 ; wherein the dessred protein is a lytic protein, proinsulim or a human growth hormone„ 

m 
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26, The use of claim 20, wherein the vector further comprises a second gene of interest operably-linked to a third 
promoter and wherein the genes of interest encode antibody polypeptides. 

27. The vector of claim 1 or 6, wherein the inducible promoter comprises the sequence as set forth in SEQ iD NO. 1 7, 
SEQ ID NO: 40, Of nucleic acids 4050-4938 of SEQ ID NO: 30. 

28= The vector of claim 14 wherein the egg directing sequence comprises at feast one of the sequences as set forth in 
SEQ ID NO: 18 s nucleic acids 4960-51 12 of SEQ *D NO: 3 5 nucleic acids 4943-5092 of SEQ ID NO: 4 ; nucleic acids 
4958-6115 of SEQ ID NO: 23 : or nucleic acids 4945-6092 of SEQ \D NO: 30, 

29, The vector of claim 7, wherein the potyA sequence comprises at least one of the sequences as set forth in SEQ 3D 
NO: 28, SEQ ID NO: 33, or nucleic acids 2995-3410 of SEQ \D NO: 1 . 

30, The vector of claims 1-15 wherein the modified transposase gene comprises an A or a T at the third position in each 
of cedons 2-10 of the modified transposase gene. 

31, The use of claim 20, wherein the vector is to be administered via an fntratesticular, intraarterial intraperitoneal, 
intravenous, intraoviduetai, intraembryonic, nasal, or pronudear route. 



Patentanjspruche 

1 v Vektor umfassend: 

a} ein rnodifiziertes Transposasegen, das operativ msi einem ersten Promoter verbunden ist, wobei die Nukle- 
insauresequenz 3* von dem ersten Promoter die in SEO ID NO: 13 dargeSegte Sequenz umfasst wobei SEQ 
ID NO: 13 die Kozak-Sequenz und ein Startcodon fur die Transposase enthalt, und wobei rotndestens sines 
der ersten zwanzig Codons des Transposasegens gegenuber der Wlldtypsequenz veranderi ist indern ein 
Nukieotid an der Position einer dritten Base des Codons gegen ein. Adenin Oder Thymin ausgetauscht ist, ohne 
dass die von dem Codon kodierte Amsnosauresequenz verandert ist, und 

b) ein oder mehrere Gen(e) von mieresse, das/die operativ met einem oder roehreren zusatzlicben Promoter 
(en) verbunden ist/sired 3 und wobei das eine oder die mehreren Gene von interesse und ?hre operativ verburv 
denen Promotoren dutch Transposaselnsertionssequenzen f lankier* werden, die von der Transposase, die von 
dem veranderten Transposasegen kodiert wird, erkannt werden, wobei der Promoter, der operativ mi! dem Gen 
von Interesse verbunden est, aus der Gruppe bestehend aus einem Ovalburninpromotor, einem Conafbumm- 
promotor, einem ViteSlogeninpromolor oder einem Ovomucoidpromofor ausgew&Nt ist. 

2. Vekior naeh Anspruch 1 , wobei das rnodifzierfe Transposasegen ein Adenin oder Thymin an der dvltten Position in 
jedem der Codons 2-10 des veranderten Transposasegens umfasst. 

3. Vektor nach Anspruch 1 , umfassend die in SEQ ID NO: 1 dargelegte Sequent 

4, Vektor nach Anspruch 1 , wobei dse Transposase eine TnlO-Transposase ist. 

5, Vektor nach Anspruch 1 oder nach Anspruch 4 : wobei der erste Promoter aus der Gruppe bestehend aus einem 
konsiitutiven Promoter und einem induzeerbaren Promoter ausgewahit ist. 

8. Vektor nach Anspruch 5, wobei der induzierbare Promotor aus der Gruppe bestehend aus einem Ovaibuminpro- 
motor, esnern Conaibuminpromotor, einem ViteHogeninpromotor oder einem Ovomucoidpromofor ausgewahit ist, 

7. Vektor nach Anspruch 1, weiterhin umfassend eine Pory~A-Sequenz> die operativ rrtit dem Transposasegen ver- 
bunden ist. 

8„ Vektor nach Anspruch 7, wobei die Pofy~A~ Sequenz eine Cona!burnimPoiy-A~Sequenz ist. 

9. Vektor nach Anspruch 1 oder nach Anspruch 7. weiterhin umfassend zwei Stopcodons, die operativ mit dem Trans- 
posasegen verbunden sand. 




EP 1530 785 B1 

TO, Vektor nach Anspruch 1, wobei ein erstes Gen von interesse operativ mit einern zweiten Promoter verbunden ist 
und ein zweites Gen von Interesse operativ mit einern dritten Promoter verbunden ist, 

11. Vektor nach Anspruch 1 , wobei ein erstes und ein zwestes Gen von Interesse operativ mit einern zweiten Promoter 
s verbunden sind. 

1 2. Vektor nach Anspruch 1 , weiterhin umfassend einen Enhancer, der operativ mit dem einen oder den mehreren Gen 
(en) von mteresse verbunden ist. 

10 13. Vektor nach Anspruch 12, wobei der Enhancer mindestens einen Teii eines Ovalbumin- Enhancers urnfasst. 

14- Vektor nach Anspruch 1. weiterhin umfassend eine Ei-dirsgierende Sequent, die operativ mit dem einen oder den 
mehreren Gen(en) von interesse verbunden ist. 

is 15. Vektor nach Anspruch 14, wobei die Ei-dirigierende Sequent eine OvaiburnirvSignalsequenz, eine Ovomucoid- 
Signaisequenz oder eine ViteUogenin-Zie&equenz ist. 

18u Verwendung esnes Vektors nach einern der Anspruche 1-15 zur Hersteilung eines nicht-humanentransgenenTieres, 

20 1 j. Verwendung nach Anspruch 1 8, wobei der Vektor uber einen intratestikularen, intraarteriellen, intraoviducialen oder 
intraernbryonalen Weg verabreieht wird. 

18. Verwendung nach Anspruch 16 t wobei das Tier ein Vogei ist, 

25 19, Verwendung nach Anspruch 18> wobei der Vogel ein Huhnchen oder eine Wachtel ist. 

20, Verwendung eines Vektors nach einern der Anspruche 1-15 zur Verabreichung an ein Tier zur Hersteiiung eines 
gewunschten Proteins, 

30 21 , Verwendung nach Anspruch 20 $ wobei das Tier ein esedegendes Tier ist, und die Verabreiehung jntradviductai ist, 
so dass das gewunschte Protein, das von dem mindestens einen Gen von interesse hergestelit wurde, aus dem 
EiweiB von Eiern, die von dem eierlegenden Tier geiegt wurden, isoiiert wird, 

22. Verwendung nach Anspruch 20, wobei der Vektor weiterhin eine TAG~Sequenz urnfasst und wobei das gewunschte 
35 Protein unter Verwendung der TAG-Sequenz aufgereinigt werden kann, 

23. Verwendung nach Anspruch 22, wobei die TAG-Sequenz urnfasst: (i) eine Seq uenz, die ein Poiypeptid kodiert, das 
als MitteS fur die Aufremigung fungiert; (ii) eine SpailsteHe; und (iii) einen Poiynukieotidspaeer. 

40 24, Verwendung nach Anspruch 22, wobei die TAG-Sequenz eine in SEQ ID NO: 22 gezeigte Polynukieotidsequenz 
urnfasst. 

25. Verwendung nach Anspruch 20, wobei das gewunschte Protein em lyttsohes Protein, Protnsufin oder ein humanes 
Wacbsturnshormon ist. 

45 

26. Verwendung nach Anspruch 20, wobei der Vektor weiterhsn ein zwestes Gen von Interesse urnfasst, das operativ 
mit einem dritten Promoter verbunden ist, und worin die Gene von interesse Antskorperpoiypeptkie kodieren. 

27. Vektor nach Anspruch 1 oder 6, wobei der induzierbare Promoter die Sequenz, wie in SEQ ID NO: 17, SEG \D NO: 
50 40 oder den Nukieinsauren 4050-4938 der SEQ ID HO: 30 dargelegi urnfasst. 

28. Vektor nach Anspruch 14, wobei die Ei~dirigierende Sequenz mindestens eine der Sequenzen, wie in SEQ ID NO: 
1 8. den Nukieinsauren 4960-51 12 der SEQ ID NO: 3, den Nukleinsauren 4943-5092 der SEQ ID NO: 4, den 
Nukseinsauren 4958-6115 der SEQ ID NO; 29 oder den Nukleinsauren 4945-8092 der SEQ ID NO: 30 dargefegt, 

ss urnfasst. 



29, Vektor nach Anspruch 7, wobei die Poly-A-Sequenz mindestens eine der Sequenzen wie in SEQ ID NO: 28/ SEQ 
10 NO: 33 oder den Nukleinsauren 2995-3410 der SEQ \D NO: 1 dargelegt, urnfasst. 
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39, Vektor nach den Anspruchen 1-15, wobei das modifzierte Transposasegen ein A oder ein T an cier dritten Position 
in jsdern der Codons 2-10 des modifzierten Transposasegens umfasst. 

31. Verwendung nach Anspruch 20, wobei der Vektor uber einen intratestikularen, intraarferieHen, intraperitonealen, 
intravenosen, intraoviductaien, intraembryonalen, nasaien oder pronukiearen Weg verabreicht warden soil. 

Revindications 

1 . Vecteur comprenant : 

a) un gene de transposase modifie We de maniere fonctionneiie a un premier promoter ou la sequence d'acide 
nuc&ique en 3' par rapport au premier prornoteur cornprend la sequence telle que representee par SEQ ID 
NO : 13, ou SEQ ID NO : 13 contient la sequence Kozak et un codon de depart pour [a transposase, et ou au 
rnoins Tun des vingt premiers codons du gene de transposase est modifie par rapport a fa sequence de type 
sauvage en changeant un nucleotide au niveau d'une troisieme position de base du codon an une adenine ou 
une thymine sans modifier I'acide amsne code par le codon, et 

b) un ou plusieurs genes d'interet lies de maniere fonctionneiie a un ou plusieurs promoteurs supplementasres, 
et ou les un ou plusieurs genes d'interet et leurs promoteurs lies de maniere fonctionneiie son! fianques par 
des sequences d'insertion de transposase reconnues par la transposase codee par ie gene de transposase 
modifie, ou ie prornoteur qui est He de maniere fonctionneiie au gene d'interet est choisi dans le groupe constitue 
d'un prornoteur de rovalbumine, d'un prornoteur de la conaibumine, d*un promoieur de la vjte^ogenine ou d'un 
prornoteur de 1'ovomucoVde, 

2. Vecteur seion la revindication 1 , dans leque! ie gene de transposase modifie cornprend une adenine ou une thymine 
au niveau de fa troisieme position dans chacun des codons 2 a 10 du gene de transposase modifse, 

3. Vecteur selon la revendieateon 1 , comprenant la sequence te^le que representee par SEQ ID NO : 1, 

4. Vecteur selon la revendication 1 f dans leque! la transposase est une transposase Tn10. 

5. Vecteur selon 3a revindication 1 ou ia revendication 4, dans lequel ie premier prornoteur est choisi dans le groupe 
constitue d'un prornoteur conststutif et d'un prornoteur inducible. 

6. Vecteur seion ia revendicafson 5, dans lequef le prornoteur inducible est choisi dans le groupe constitue d'un 
prornoteur de I'ovalbumine, d'un prornoteur de ia conalbumine, d'un prornoteur de ia viteHogenine ou d'un prornoteur 
de i'ovomucolde. 

?. Vecteur selon ia revindication 1 , comprenani en outre une sequence polyA liee de maniere fonctionneiie au gene 
de transposase. 

8. Vecteur selon ia revendication 7, dans lequel la sequence polyA est une sequence polyA de la conaibumsne. 

9. Vecteur seion la revendacation 1 ou la revendication ?\ comprenant en outre deux codons stop Nes de maniere 
fonctionneiie au gene de transposase. 

1 0» Vecteur selon la revendication 1 , dans Sequei un premier gene d'interet est lie de maniere fonctionneiie a un deuxieme 
prornoteur et un deuxieme gene dlnteret est lie de maniere fonctionneiie a un troisieme prornoteur. 

11. Vecteur selon la revendication 1, dans lequei un premier et un deuxieme genes d'interet sont lies de maniere 
fonctionneiie a un deuxieme prornoteur. 

1 2. Vecteur seion la revindication 1 , comprenant en outre un ampisficateur We de maniere fonetsonnelie a Tau mosns un 
gene d'interet. 

13. Vecteur selon la revendication 12, dans lequel fampfificateur cornprend au moins une partie d'un ampisficateur de 
rovaibumine. 
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1 4. Veeteur selon la revendication 1 , cornprenant en outre une sequence dirigeante de i'oeuf Nee de roaniere fonctiannelle 
a Pan rnoins un gene d^nterei 

15. Veeteur selon la revendication 14, dans lequei fa sequence dirigeante de I'oeut est une sequence signal de 3 v oval- 
5 bumine, une sequence signal de i'ovomucojde ou une sequence de cibiage de la vitellogenins. 

16. Utilisation efun veeteur selon Tune quekxmque des revindications 1 a 15, pour produira un animal transgenique 
non hurnain. 

io 17, Utilisation selon la revendication 16, ou le veeteur est adrnimstre par une voie intratesticuiaire, intra-arterieHe, intra- 
oviductaie ou intra-ernbr/onnaire, 

IS, Utilisation selon le revendication 18 s ou fanimai est un animal aviaire. 

?i> 19, Utilisation selon la revindication 18, ou i'animal aviaire est un poulet ou une eaille. 

20. Utilisation d'un veeteur selon Tune quelconque des revendications 1 a 15, pour etre administre a un animal pour 
produire une proteine souhaitee. 

so 21. UtiHsatiOEi selon la revendication 20, ou Tanimat est un animal qui pond des oeufs et ^administration est intra- 
oviductale, de telle maniere que la protein© souhaitee produite par l f au moins un gene dlnteret est isoiee du blane 
des oeufs pondus par fanimai pondant des oeufs. 

22. Utilisation seion ia revendication 20\ ou ie veeteur comprend en outre une sequence TAG el ou la proteine souhaitee 
25 peut etre purifiee en utiiisant la sequence TAG. 

23, Utilisation seion la revendication 22, ou ia sequence TAG comprend : (i) une sequer?ee qui code pour le polypeptide 
qui fonctionne eomme une poignee de purification ; (H) un site de elivage ; et (iii) un espaeeur polynudeotidique. 

so 24, Utilisation selon 3a revendication 22 ; ou la sequence TAG comprend une sequence polynucleofidique representee 
par SEQ ID NO : 22. 

25. Utilisation selon la revendication 20, ou la proteine souhaitee est une proteine lytique, la proinsuHne ou une hormone 
de croissance hurnaine. 

35 

28. Utilisation selon ia revendication 20, ou le veeteur comprend en outre un deuxieme gene d'interet lie de maniere 
ronctionneiie a un troisieme promoteur et ou les genes d'interet cedent pour des polypeptides d'anticorps. 

27, Veeteur selon la revendication 1 ou 6. ou le promoteur inducible comprend la sequence telle que representee par 
40 SEQ 3D NO : 17, SEQ ID HO : 40 ou les acides nucleiques 4050 a 4938 de SEQ \D NO : 30. 

28, Veeteur selon la revendication 14 5 dans lequei ia sequence d'mgeantQ de 1'oeuf comprend au rnoins Tune des 
sequences telles que representees par SEQ ID NO : 18, les acides nucleiques 4980 a 5112 de SEQ ID NO : 3 ; les 
acides nucleiques 4943 a 5092 de SEQ ID NO : 4, les acides nucieiques 4958 a 61 15 de SEQ ID NO : 29 ou ies 

45 acides nucleiques 4945 a 8092 de SEQ ID NO : 30. 

29, Veeteur selon la revendication 7, dans lequei la sequence polyA comprend au moms 3*une des sequences te \ ies 
que representees par SEQ ID NO; 28, SEQ ID NO : 33 ou §es acides nucleiques 2995 a 3410 de SEQ ID NO : 1. 

50 30, Veeteur selon Ses revindications 1 a 15, dans lequei ie gene de transposase modifie comprend un A ou un T au 
niveau de la troisieme position dans chacun des codons 2 a 10 du gene de transposase modifie. 

31 , Utilisation selon ia revendication 20\ ou se veeteur doit etre administre par une voie intratesticuiaire, intra-arterielle ; 
intraperitoneale, intraveineuse, intra-oviductaie, intra-ernbryonnaire : nasale ou pronucieaire, 

55 
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